FIGURE i 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CATACAT T AC T GCAG T AACAC TCCACCATATAGACC C GGC T T T AC C T TAT AT CAGT GAC ACT GG 
TACAGTAGCTCCAGAAAAATGCTTATT TGGGGCAATGCTAAATAT TGCGGCAGTTT TAT GCAT TG 

C T AC CAT T TAT G T T C G T T AT AAGC AAG T T CAT GC T C T GAG T C C T GAAG AG AAC G T TAT CAT C AAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATAT GTTTGTTCAGACCATCCT TTCCTACCAAATGCAGCCCAAAAT CCATGGCAAACAAGT C 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGATGAAAGGATAAAATAT TT CTGTAATGAT TAT GAT TC TCAGGGATTGGGGAAAGG 
T T CACAGAAG T TGC T TAT TCTTCTCT GAAAT T T T CAAC C AC T T AATC AAGGC TGAC AGTAACAC T 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
CAAGAAGACTATTAAAAACACC TATGCCTATACTTT TTTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLS FLPSALVIWTSAA.FI FSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
f*3 CATTCTCAGCCCAAAACAT GGGATCTTAT CCATAGAACAGCTCAT CAGCCGGGTTGGTGTGATT GGAG 

f|| TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
||| CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
if! CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTT CT GGGGAAT GAT AAAAAGT GTTACCACTTCAGCATCAGGAAGT GAAAATCT T ACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCTACCAAGGAGAGAATAGAATAC T CCAAAACCTT CAAGGGGAAATAT T TTAATT TT CTT GGT T 
§$% ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
|J| . GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

ffj GAAGTTT TGGTCCCAACACATT T CCTTCATTC T TGT TGGAATAAT CAT C GT CACAT CCATCAGAGGAT 

O TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
W CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCAC CTTGAACTTAAGCCTACTACAGACTGT T AGAGGCCAGTGGTTTCAAAAT TT A 
GAT ATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACAT T TTATAAACAAACAAAAT GCTATGGT AGC 
ATT TTTCACCTT CATAGCATACT CCT TCCCCGT CAGGTGATAC TATGAC CAT GAGTAGCAT CAGC CAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTT GAGATTGAC T CATTAAAAT CAGAGACTGTAACAAAAZWU^AAAAAAAAAAAAAGGGCGGCC GCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE A 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAWCPYTYMSYFLRKVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQ FNFYHRW FDVI FLVSAL S S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



App_ID=l 0063537 



Page 



FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GT CAT GCAGAAT TAT GGGGAT CAC CCT T GT GAGCAAAAAGGCGAACCAGCAGCT GAAT T T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATT.CGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TAT CAC CAC C AAAG AT C C CAT AT T C AAC AC T C AAAC T G C AAC AC AAAC AACAGAAT T TAT T G T 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
P GAAAC TAG CAC CAT G T C T AC AG AAAC T GAAC CAT T T G T T G AAAAT AAAG C AG CAT T C AAG AAT G A 

pi AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
fl CAGC T GGTC T T GGAT T T T GC TAT G T CAAAAGGTAT G T GAAGGCC T T CCCT TT TACAAACAAGAAT 

W CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
Ifl T GAGGAAT CAAAGAAAAC T GATAAAAACC CAGAAGAGT CCAAGAGT CCAAGCAAAACTAC CGT GC 

||| GAT GCC T GGAAGC TGAAGT T TAGATGAGACAGAAAT GAG GAG AC AC AC C T GAGGCT GGTT T C T T T 

Sy CAT GC T C C T T AC C C T GC C CCAGC T GG GGAAAT C AAAAGG G CCAAAGAAC CAAAGAAGAAAG T C C A 

CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACC'CTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGT CACT GAGACCAAGGC T T TC TCTAC TGATT CC GCAG C T CAGACCC T TT CT T CA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
kl GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
f|| ACAGACAGGGTCAAAGTGTT T T C TC TGAACACAT T GAGT T GGAATCAC T GT T T AGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC TCTTATAAAT TTCTATT TT TATCTGAGTTACAGAAAT GAT TAC TAAGGAAGATT 
AC T CAGT AAT T TGT T TAAAAAGTAATAAAATT CAACAAACAT T TGC TGAATAGC TAC TAT AT GTC 
AAGT GCT G T GCAAGGTAT T ACAC TC TGT AAT T GAATAT TAT T CC TC AAAAAAT T GCACATAGTAG 
AAC GC TAT C T G GGAAG C T AT T T T T T T CAG T T T T GAT AT T T C T AGC T TAT C T AC T T CCAAAC T AAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATAC C T AAGAAGTACAT TGT TACCTC TAT AT AC CAAAGCACAT T T TAAAAG T GC C 
AT TAACAAAT G TAT CAC T AGCCC TCC T T T T T CC AACAAGAAGGGAC TGAGAGAT GCAGAAATAT T 
TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVE TALKAS FE TC S YGWVGDG FWI S RI S PNPKCGKNGVGVL I WKVPVS RQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVIALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPSESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-8 8 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT T GCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGC TGCT GC TA 
AAG CAT CAT C AGAAG T G AAC C T GG C AAAC T T AC C T C C CAGC TAT C AC AAT G AGAC C AAC AC AG AC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAAT GGTCTTTT CAGAGACAGT TATCACATC TGTGGGAGACGAAGAAGGCAGAAGGAGCC 
jvfa ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
p ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
p GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
«fe ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
fl ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCAT.CAC 
fjf CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
pi CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
III GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
V| CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
p CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
p TAGAAAT AGCTAAT T TAT TTCCCCAGGT GT GTGCT TTAGGCGTGGGCTGACCAGGCT TCT TCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
M GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
[| T T TGTT C T ACAT GGC T T T GAT AATT GT T T GAGGGGAGGAGATGGAAACAAT GTGGAGT CT CCCTC 

Cl TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
III CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TT T T CT T GAGGCATGCACATC TGGAAT T AAGGT CAAACT AAT TCT CACAT CCCT C TAAAAG TAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTAC T T AGGTAATT GT AGGGCGAGGAT TATAAATGAAAT T T GC 
AAAAT CAC T TAGCAGCAACT GAAGACAAT TATCAAC CACG T GGAGAAAAT CAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATT G TATAAGCATGC T T T C T T TGAGT TT TAAAT T AT GT ATAAACATAAGT T GCAT TT AGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRL GAT LLCLLLAAAVPTAPAPAP TAT SAPVKPG PALS YPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVNLANLPPS YHNETNTDTKVGNNT IHVHRE I HKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPT FVGSRDQDGE ILLPREVPDEYEVGS FMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 




Signal sequence: 



amino acids 1-19 



m 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 



||| 



327-331 



ta- 
rn 



N-myristoylation site. 



amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCC.CTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TAT AT T GACCACAAGC T C AGAGGAGT GC AGAACCCT GT AGCC CGC T GCAT CAT GT GC TG T T T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
. AAGAT T T TAT T AAAGAT AT T T T G T T AAC T C 



FIGURE IP 



RTRGRTRGGCEKVP INTSCNPTAHLVNS S CPGLMCVFQGYSSKGL I QRS VFNLQ I YGVLGL FWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPLI SAFIRTLRYHTGSLAFGAL ILTLVQI ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



m 

5 



in 



Mr 



Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 



pi amino acids 204-231 



N-myristoylation sites. 

amino acids 60-66, 78-84 



Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCGGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GAC T TCGT GCCTTACT GAGTCTCTAAGACTTT TTCTAATAAACAAGCCAGTGC GTGTAAAAAAAA 



FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGL 1 1 FLLCTL FI S LRS S DHRQVNS LMQTEECP PMLDATQQQQQQVAACEGRAFDNEQDGVT YS Y 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-2 0 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGACTCCTGTAACCC TC C T CCAGGATGAACCACC TGCCAGAAGACAT GGAGAACG 
C T C TCACCGGGAGCCAGAGC T CCC AT GC T T CT C T GCGCAATATCCATT CCAT CAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTATTCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 

t" AC T GCGCCAT TGGT GGGCAATAGC GT TGACAACGGCAGT GACCAGT GCC T T T T TAC TAGCAAAAG 

P 

0 TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 

0 ! C T T GC CT GGAT TGAGACGT GGT TCC TGGAT T T CAAAGT GT TACC TCAAGAAGCAGAAGAAGAAAA 

III C AGAC T CC T GATAGT T CAGGAT GC T T CAGAGAGGGCAGCACT TAT ACC TGGT GGTCTTTCT GATG 

f j GTCAGT T T TAT T CCCCT CCT GAAT CCGAAGCAGGATCTGAAGAAGC TGAAGAAAAACAGGACAGT 

J 1 ! GAGAAACCACT T T TAGAACTATGAGTACT ACT T T TGT TAAATGTGAAAAACCC T CACAGAAAGTC 



ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 

r 

p CAC T GCT GGCTT TAT T GAACAGCTAATAAAGAT T TAT T TATTGT AATACC T CACAAACGT T G TAC 

III CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 

pi 

*«* TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 

fit 

JiJ AAAAGAC T T AATATAT T GAAGTAACAC T T T TT TAGTAAGCAAGATACC TT T T TAT T T CAAT T CAC 

r .2 

i|| AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 

TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T GC AAAAGGGGAAGAAAGGAAT T GC GAAT ACAT GT AAAAT GT CACCAGACAT T TGT ATT AT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE id 

MNHLPEDMENJU.TGSQSSHASLRNIHSINPTQIMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 



EST 

Ill 



rii 



Important features of the protein: 
Signal peptide: 



H amino acids 1-20 



rs-i Transmembrane domains: 

111 amino acids 54-72, 100-118, 130-144, 146-166 



N-myr i s toyl a ti on s i tes . 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGGGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
fa CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
«*% GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACGAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
P CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGG 
|ll TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
C| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
\}\ TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
p! CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
III CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
M TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
GGCC TGTACT G TGAGAGCCAGAT GGGGCAGGGGACACGG C CCAGCC CT ACACCAGT CACGC CGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
CI AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
fll GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
pi GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
I*; GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
m CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGAC TCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGAT GC 
T T TAGGAAC AT GTTTTGCTTTT TTAAAAT AT AT AT AT T T AT AAGAGAT CC T T T CCCAT T TAT T C T 
GGGAAGAT GTT T TT CAAAC T CAGAGACAAG GAC TTTGGTTT T TGTAAGACAAACGATGAT AT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



IS 

© 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGS FAGLPGLQLLDLSQNQ IASLPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 
LGKNRI RH I QPGAFDTLDRLLELKLQDNE LRALP PLRLPRLLLLDLSHNS LLALE PGI LDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALT^AV 
LLAAIiAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVWPLEPGPKATEGGGEALPSGSE 

CEVPIMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide : 

amino acids 1-23 
Transmembrane domain : 

amino acids 579-599 

EGF-like domain cysteine pattern signature . 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTA 
AAGGACCATAC TAC T GCAGGCAGAG TAGTTGC TGGT CAAAT AT T T CT T GATT CAGAAGAAT CT GA 
AT TAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCTTTCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
CTACAACCTATGAC TACAAAGCAGATGAAAAGTGGGGCTT TT GTGAAACT GAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTAT C AAAC T GGAAT GAAAAT CC T TAAT GGAAG 
CAATAAGAAAAGCCAAAMAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCT CTT TTAT TTGGTGAT TAC T TGCCACAGAATAT CCAG 
GCAGCGAGAGAGATGT TT GAGAAGCTGACT GAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AATAATAT TAACATCAGAAGAATTTGTGGT TTATAGCGGCCACAACT T TTTCAGCTTTCATGATC 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

AACACAT GGAATC TACAT GTAAATGAAAGT T GGTGGAG T C CACAATT T TT CT T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAPlTGGCTCTTTTTAAATTTTCTCTGAGTTG 

G AAT T G T C AGAAT CAT T T T T TACAT T AGAT TAT C AT AAT T T T AAAAAT T T T T C T T T AGT T T T T C A 
AAATTTTGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATTTTGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC TCAT T T T T AAT AAAAT T AT GT CTAAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAA?W^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 

EYDECTSDGREDGRLWCATTYDYKADEKWGrc^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL I AHMVLVS RL 



Important features: 
|,j s Signal peptide: 



amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 



Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTAT T AT ACAAGAAAGG AGT GT ACC TATCACACAC AGGGGGAAAAATGC T C T T T TGGGT GC TAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T TT ATCACTGGATGT GACT CGGGCT T TGGAAACTTGGCAGCCAGAACTT T T GAT 
AAAAAGGGAT TTCATGTAAT CGCTGCCT GTCTGACTGAAT CAGGATCAACAGCT TTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CC TG TT T GGACTCATC AG TGTGACAC TAAATATGCT T CCT TTGGTCAAGAAAGC TCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGGAATCGTTGGAGGGGGCTATACTCCATCCAAA 
T AT GCAG T GGAAGGT TT CAATGACAGCT TAAGACGGGACAT GAAAGCT T T T GGT GTGCACGT CT C 
ATGCATTGAACCAGGAT T GTT CAAAACAAACT T GGCAGAT CCAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCAT T TGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACAT TGAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

T TT TCTGGATACC TC TGTC TCACAT GCCAGCAGC T T T GCAAGAC T T T T TAT T GT T GAAACAGAAA. 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCAT CTC T T ATC TAAAT AT TAAAAGAT AAGTCAACCCAAAAAAAAAA&AAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKMTSERLRTVLLDVTDPEOTKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIA^ 

LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites, 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TACTGAT T C CCAAATGGATGAT G T TGAAGT T GT T TATACAAT TGA 
CAT TCAGAAATATAT T CCAT GC TATCAGCT TT T TAGCT T T TATAAT TCT T CAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAAT AT TAT CAAATGTCAAAAAGAAT GT GGTAGGT T GGTACAAAT T CCGT 
CGTCAT TCAGATCAGAT CATGACGT TTAGAGAGAGGCTGCT TCACAAAAACTTGCAGGAGCATT T 
T T C AAACCAAGAC CTTGTTTTTCTGCTAT T AACACC AAGT AT AATAACAG AAAGC TGCT C T AC T C 
ATCGACT GGAACATTCCTTATATAAACCT CAAAAAGGACT TTT TCACAGGGTACCTT TAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TTT TAGCC GAGCAGTACAAACACACAGC TC TAAAT T T T T T GAAGAAGAT G GAT CC T TAAAGGAGG 
T ACATAAGAT AAAT GAAAT GT AT GC T T CAT T ACAAGAGGAAT T AAAGAG T ATAT GCAAAAAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTiVAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCT.CAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCT T T AAAAAATAGACATGT T TC TAAAAGTAGC T GTAACTACAACCACCAT C TCGATGTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
AAAT CAT T AAGCATAAAGCCT TAGAC T TAGATGACAGAT GGCAAT TCAAGAGATC TCGGT T GTT A 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGTT TTGGTGAATATTCACGGTCTC 
C TACATT T TGAT CC T T TTAACC T TACAAGGAGAT T T T T TTAT T T GGC T GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 
GATGCT T T TAT T TCCAAACC T T T T T T T CACC TT TCACTAAG T T GT TGAGGG GAAGGCT TACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT TGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTC TAT TAAAAAATAAAAT GGAAAAGCAAGAATAGC C T TAT T TT CAAAATATGGAAA 
GAAAT T T AT ATGAAAATT TATC T GAGT CAT TAAAAT T CTCC T TAAGT GAT AC TT T T TTAGAAGTA 
CAT TAT GGCT AGAGT T GCCAGATAAAAT GC TGGATATCAT GCAATAAAT T T GCAAAACAT CAT C T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



i 



w 

0| Signal peptide: 

ill amino acids 1-19 
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FIGURE 22 

MEGE S T SAVLS GFVLGALAFQHLNTDS DTEGFLLGE VKGE AKNS I TDSQMDDVEWYT I D I QKY I 
PCYQLFSFYNSSGEVNEQALKKILSNWK]WVGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKS I CKKVEDSEQAVDKLVKDVNRLKRE IEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDI PEAS PAS T PQ 1 1 KHKALDLDDRWQFKRS RLLDTQDKRS KANT G S SNQDKASKMS S PE T 
DEEIEKMKGFGEYSRSPTF 

Important features: 



N-glycosylation sites, 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GG CACAGC C GCG C GGCGGAGGG CAGAGT CAGC C GAGCCGAGT C CAGCC GGACGAGCGGAC CAGCGCAGGGCAGCC CAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCAGCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
I ■< AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTAGCTGCTGCTCT 
r% CAAGAGAT GGAGGAGAAGTACAGC CT GCAAAGT AAAGAT GATTT CAAAAGT GT GGT CAGC GAACAGT GCAATCAT TT G 

f*| CAAGCT GT CTTT G CTT CACGT TACAAGAAGTTT GAT GAATT CTTCAAAGAACTACT T GAAAAT GCAGAGAAAT CCCT G 

ff| AAT GATAT GT TT GTGAAGACATATGGCCAT TTATACAT GCAAAATT C T GAGCTAT TTAAAGAT CT CTT CGTAGAGTT G 

||| AAACGTTACT AC GTGGT GGGAAAT GT GAACCT GGAAGAAAT GCTAAAT GAC TT CT GGGCT CG C CT CCT GGAGCGGAT G 

III TTCCGCCT GGT GAACT C C CAGTACCACT TTAC AGAT GAGT AT CT GGAAT GT GT GAGCAAGTATACGGAGCAGCTGAAG 

[,|J CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
%j TTAGC GGTT G CGGGAGAT GT C GT GAGCAAGGT CTCC GT GGTAAAC C CCAC AGCCC AGT GTACCCAT GCCCT GTT GAAG 

l\ AT GAT CTACT GCT CC CACTG C C GGGGT CTCGT GACT GT GAAGCCAT GTTACAACTACT GCT C AAACAT CAT GAGAGGC 

|3 TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
|ll GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
|3 AATAGT GTTCAAGTGT CT CAGAAGGTTT T C CAGGGAT GTGGAC CCC CCAAGCCC CT CC CAGCT GGACGAAT TT CT C GT 

III TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTT GGACC GACT GGTTACT GAT GT CAAG GAGAAACT GAAACAG GCCAAGAAAT T CT GGT CCTCC CT TC C GAGCAAC 
f|| GTTTGCAACGAT GAGAGGAT GGCT GCAGGAAACGGCAAT GAGGAT GACT GTT GGAAT GGGAAAGG CAAAAGCAG GTAC 

CT GTTT GCAGT GACAGGAAAT GGATTAGC CAAC CAG GGCAACAAC C CAGAGGT CCAGGTT GACAC CAGCAAACCAGAC 
ATACTGAT CCTTCGT CAAATCATGGCTCTT C GAGT GAT GAC CAGCAAGAT GAAGAATGCATACAATGGGAACGACGTG 
GACTT CTTT GATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GGCTGT GAGTAT CAG CAGT GC C CTT CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCT CCT CACT GT CTT CT GCAT CTTGTT CCT GGTTAT GCAGAGAGAGT GGAGATAATT CTCAAACTCTGAG 
AAAAAGTGTTCAT CAAAAAGTTAAAAGGCAC CAGTT AT CACTT TT CTACCATC CTAGT GACTT T GCTTTTTAAAT GAA 
TGGAC^CAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCAT T GAGTT GGT TCCTGCTCCCC CAAACCATGT TAAACGT GGCT AACAGT GT AGGTACAGAA 
CTATAGTTAGT TGT GCAT TT GT GATT TT AT CACT CTAT T ATTT GT T T GTAT GTT TTTTT CT CAT TT C GTTT GT GGGTT 
TTTT TTTC CAACT GT GAT CT CGCCTT GTTT CTTACAAGCAAACCAGGGT CCCT T CT TGGCACGT AACAT GTAC GTATT 
T CT GAAAT AT TAAATAGCT GT ACAGAAGCAGGTT T TAT TTATCAT GTTAT CTT ATTAAAAGAAAAAGCCCAAAAAGC 
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ill 
- 

Ill 



FIGURE 24 



MARFGLPALLCTLAVLSAALLAAELKSKSCSETO^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKS LNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGM 

VTWPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMIWAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRI SRS I SESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSOTCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

N« KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGS.GSGCEYQQCPSEFDYNATDHAG 

jj*f KSANEKADS AGVRPGAQAYLLTVFC I LFLVMQREWR 



Important features: 
Signal peptide: 

amino acids 1-22 



f*H ATP/GTP-binding site motif A (P-loop) 

Ill amino acids 515-524 

Ijf N-glycosylation site. 

1*1 amino acids 514-518 



Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 



Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2* 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCATCCCCAGGGATCAGGAGCCT CCAGCAGGGAACCT TCCAT TAT AT T C TTCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCT GCCAGCAAT TTCT CAAACAAT GT CAGCTAAGAAGC T T TGC TG T GCC T T T G TAG GAGC T CTG 
AGCGCCCACT C T T CC AAT T AAACATT CT CAGCCAAGAAGACAGT GAGCACACC T ACCAGAC AC T C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGT GTCAT T TAACCT TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAAT AT T TT T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMmSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



r s-7 



III 



Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



■ GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGC CGT GCTGAC TCCCAAGAGCAACC GCAAGATGGAGAGCAAGAAAC GG GAGCTCT T CAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGGATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
U GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 



p 

If! 

US GAGCCCACC TGGAAACACAT TGGGGAT GG CT GC TGC CTCACCAGAGAGAC CT GGAAGGAT C T T GA 



111 GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
III GGCCCCACAT CATGGGAAAGGCTGTCAAACAATC TT TCCCAAGCTCCAAGGCAC TCATT TGC T CC 

M 

T TCCCCAGCCTCCAAT T AGAACAAGCCACCCACCAGC C TAT CTAT CT T CCACTGAGAGGGAC CTA 

B 

m GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
HI AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GC CAGAGCAAGAC T CAAAGAGGCAGAGG T T T T GT TC TCAAATAT T T T TTAAT AAATAGACGAA 
ACCACG 



IP 

ni 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHIimLLGCWQPLCKSYFPYmAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQIJUDGSMDVWCTLVLCSVQSPRKVLQETORVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 



Important features : 

g*= Signal peptide: 

f% 

amino acids 1-23 



||| Leucine zipper pattern. 

||l amino acids 10-32 



m 



N-myristoylation sites, 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



ill 
ill 

lil 



hi 

if! 
p 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCGCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCT GGGAGATAGGGAACAGAAGAGGGTAGTGGGT GGGC TAGGGGGGCTGCCT TATT TAAA 
GTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLL T LLLLLLLLKGSCLEWGLVGAQKVS SATDAP I RDWAFFP PS FLCLLPHRPAMTC SQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE ai 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTC TGCGGT GT T TTATGGTGGC TGTATT ATGAC TAT ACCAACGACC T CAGCATAGAAT TGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
M- CTGCTCGTCTT GAT TTTTGTTCT CAGAAAGAGAAT AAAAT T GAC AGT T GAGC T T T TCCAAAT CAC 

P AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 

TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
r[| ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
If! AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
||j TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
L GAT TC C GAGAAT CAT T GT CATG TACAT GCAAAAC GC AC T GAAAGAAC AGCAGCATGG T GCAT T G T 

CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
r*| AACCAGAATGCATATACTACAACTGCTATTAATGGGACAGATT TC TGTACATCAGCAAAAGATGC 

III AT T C AAAAT C T T G T C C AAG AAC T C AAG T CAC T T TACAT C TAT T AAC T G C T T T GG AGAC T T C AT AA 

n 

Jf; TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
f ^ AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 

CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC T CCAGGCCAT T GTGAGATAGATACCCAT T TAGGT AT C T GT ACCT GGAAAACAT T 
TCCT TCTAAGAGCCAT TTACAGAATAGAAGATGAGACCACTAGAGAAAAGT TAGTGAATTTT TTT 
T TAAAAGACC TAATAAACCCTAT TC T T CCT CAAAA 
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FIGURE 32 

MSGRDTILGLCILAI^SIJ^FTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAI VS TG I TAVLLVL I FVLRKRI KLTVELFQ I TNKAI S SAP FLL FQPLWT FA 
ILIFFWLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGS SEKPYFMDQE FLS FVKRSNKLNNARAQQDKHS LRNEEGTELQAI VR 

; , 

S*3 Important features: 

Signal peptide: 

fli amino acids 1-20 

III Putative transmembrane domains : 

**l amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

pi N-glycosylation sites. 

ft amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 



App_ID=l 0063537 



Page 182 of 318 



FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATT TT TT GAAAAAAAAAT T GCC T T CT TCAAACAAGGGTGTCAT T CT GATAT T T ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T C AAACAAAGAAAC G G C AAAGAAGAT T AAAAGG C C C AAG T T C AC T G T G C C T C AGAT CAAC T GC 
GATGTCAAAGC CGGAAAGAT CAT CGAT CCT GAGT TCAT T GT GAAATGT CCAGCAGGATGC CAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TAT T CCAACGG T GT C CAAT CGT TAT CCCTACCACGAT GGAGAGAATC C T T 
TATCGTCT TAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCAT CAGC T CT TACATAC T CAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTAT T CCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
p GCCACAGGAGCCAGGAGAT GGATCTC TGGT CCAC TGCCACCTACACAAGCAGCCAAAACAGGCCC 

li\ AGAGC TGAT CCAGGT AT CCAAAGGCAAGAT CC T TCAGGAGCTGCCT TCCAGAAA.CC TGT T GGAGC 

GGATGTCAGCC TGGGACTT G T TC CAAAAGAAGAATTGAGCACACAGTC T T T GGAGCCAGT AT CCC 
T GGGAGATC CAAACTGCAAAAT T GAC T T GT CGT T TT TAAT T GAT GGGAGCACCAGCAT T GGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
III CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
Si ACACGAAT T CT CGAGATCT GAAGACAGCCATAGAGAAAAT TACT CAGAGAGGAGGAC T T TC TAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
jjU GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
M GACT T GCGAGAGAGT CAGGAATCAACAT T T TC T TCAT CACCATT GAAGG T GCTGC T GAAAAT GAG 

111 AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
P GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
Wl ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
T GGAGT T TGGG T TCGACAAG TACAGCAGCAAGCC T GACAT CC TCAACGCCAT CAAGAGGGT GGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAAT GATCC TCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGC TAGAAG TCAT TGC CACT CACCCCGCCAGAGACCAC T CC T TC T T TGT GGACGA 
GT T TGACAACC TCCAT CAGTATGTCC C CAGGAT CATCCAGAACAT T T GTACAGAGT TCAACT CAC 
AGCC T CGGAAC TGAA T TCAGAGCAGGCAGAGCACCAGCAAGTGC T GC T T T AC TAAC TGACGT GT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTC T T GT TATTAT T C T T TGCCAT CATGC T T T T T CATATT CCAAAACT T GGAGTTACAAAGA 
T GATCACAAACGTATAGAAT GAGCCAAAAGGC TACATCATGTT GAGGG T GCT GGAGAT T T TACAT 
T T TGACAAT TGTT TTCAAAATAAATGT TCGGAATACAGT GCAGCCCT TACGACAGGCT TACGTAG 
AGC T T T T GT GAGAT T T TT AAGT T G T TATT TC TGAT T T GAAC T CT GTAACCCT CAGCAAGT T T CAT 
T T T T GT CAT GACAATGTAGGAAT TGC T GAAT TAAATG T T TAGAAGGAT GAAAAATAAAAAAAAAA 
AAA?y^AAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAA2^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 24 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
C QDPKYHVYGTDVYAS YS S VCGAAVHS GVLDNS GGK I LVRKVAGQS G YKGS Y SNGVQS L S L PRWR 
E S FI VLESKPKKGVT YPS ALT YS S SKS PAAQAGE TTKAYQRPP I PGTT AQPVTLMQLLAVT VAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPOTWVMVDGWPTDK^^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 
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FIGURE 3* 



ill 

III 

ni 

n 

III 



CCGAGC AC AGGAGAT T GCCTGC GT TTAGGAGGT GGCTGCGTT GTGGGAAAAG CT AT CAAGGAAG AAAT TGC 
CAAACC AT GT CT TTT T TT CTGT TT TCAGAGTAGT TCACAACAGATCTG AGT GTT TT AATT AAGC AT GG AAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGT GAC CT C CC ACCC TT CAGATGT G AAAGCC AG GC AGGCC ATT AGAGTT AC TT GGGGT GAAAAAAAGT CT 
TGGT GGGG AT AT GAGGTT CTT AC AT TTT TC TT AT T AGGCC AAGAGGCT GAAAAG GAAGACAAAAT GTT GGC 
AT T GTCCT T AGAGGAT GAAC AC CTT CTT T ATGGT GACAT AAT CCGAC AAGAT TT TT T AGACACATAT AAT A 
ACCT GACCT T GAAAAC CAT TAT GGCATT CAGGT GGGT AAC TG AGT TTT GCCC CAAT GCCAAGT ACGT AATG 



ll'l AAGACAGAC AC T GAT GTT TT CATC AAT ACT GGCAAT TT AGTG AAGT AT CT TT T AAACCT AAACC ACTC AG A 

||| GAAGTT TT T C AC AGGT T ATCCT CT AAT T GAT AAT TATTCC TAT AGAGGAT TT T ACCAAAAAACCC AT ATTT 

*?| CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
B TTGGTGCC AAGG AT CT AT GAAATG ATGG GT C ACGTAAAAC CC ATCAAGTT TGAAGATGTT T ATGT C GGGAT 

CT GTTT GAATT T AT T AAAAG TGAAC AT T CAT AT T CC AGAAGAC ACAAATCTT T TCT TTCT AT AT AGAATCC 
ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
TGGC AGGT CAT G CT AAGGAACACC AC AT GC CAT TAT TAA CT T CACAT T CT AC AAAAAGCC TAGAAG GAC AG 
GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
AC AC TGAACT GAAACT CAT G AAAAACCC AG ACT GGAGACT GG AGGGTT AC ACT TGT GAT T TATT AGTC AGG 
CCCT TC AAAGAT GAT ATGT G GAGG AATTAAAT AT AAAGGAAT TGGAGGTTT T TGCT AAAGAAAT T AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CT AGGC TGT AAAAACAAAAC AATGTAGAGT TTT ATT TATT GAACAATGT AGT CACTT G AAGGTT T T GT GT A 
TAT C TT AT GT GG AT T ACC AATT T AAAAAT AT AT GTAGTT C TGTGTC AAAAAACT TCTT C ACT GAAGTT AT A 
CT GAAC AAAAT T TT ACCT GT TT TTGGT CAT TT AT AAAGT ACT TCAAGATGTT GC AGTATT TCAC AGTT ATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
TGAATC AT T C TT T ACATGC AAAC AT TTT CC AGT T ACTT AACT GATC AGTTT ATT ATTGAT ACAT C ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAGAAT T AAAGCAAGAAAATCT G AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPL I DNYS YRG FYQKTH I S YQE YP FKVFPP YC S GLGY IMSRDLVPRI YEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 



t k Important features: 

P| Type II transmembrane domain: 

ili amino acids 20-39 



tft N-glycosylation sites. 

Ill 

hi amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 

amino acids 239-243 



Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCaAG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTC CTTGCC AAGAGAGTACAC AGTC AT TAATGAAGCC T GCCC TGGAGCAGAGT GGAATATCATGTGTCGGGAG TGCTGTG 
AAT ATGATC AGAT TGAGTGCGTC TGCCCCGGAAAGAGGGAAGTCG TGGGTTATAC C ATCCC T TGC TGC AGGAATGAGGAGAA 
TGAGTGTGACT C C TGC C TGATCC AC CC AGGT TGTAC CATC TTTGAAAAC TGCAAGAGC TGCCGAAAT GGCTCATGGGGGGGT 
ACC T TGGATGAC T TC TATGTGAAGGGGTTCTAC TGT GCAGAGTGCCGAGCAGGC T GG TACGGAGGAGAC TGCATGC GATGT G 
GCC AGGTTC TGC GAGC C CCAAAGGG TC AGAT T T TGT T GGAAAGC TAT C CCC TAAATGCTCAC TGTGAAT GGAC CAT T CAT GC 
TAAAC CTGGGT T T GT CATCC AAC T AAGAT TTGTCAT GTTGAGTC TGGAGT T T GAC TACATGTGC CAGTATGAC TAT GTTGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGC GTGTC TGT GGCAACGAGCGGCCAGC TCC TAT CC AGAGCAT AG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
h h ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 

TATAC TGGGCAGC GC TGTGAAAAT C TCC T TGAAGAAAGAAACTGC TCAGACCC TGGGGGCC CAGTCAATGGGT ACCAGAAAA 
|3 T AAC AGGGGGC CCTGGGCT TATC AACGGACGCCAT GC TAAAAT TGGCAC CG TG GTGT CT TTCT T TTGTAAC AACTCC TAT GT 

l| 1 TC T T AGTGGCAAT GAGAAAAGAAC T TGCCAGC AGAAT GGAGAGTGGT CAGGGAAACAGCCC ATC TGCATAAAAGC C T GCC GA 

hi GAACC AAAGAT T T CAGACC T GGTGAGAAGGAGAGTTC TTC C GATGCAGGT TCAGTC AAGGGAGAC AC CATTACACC AGCTAT 

|j| ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
f i I ATACC AAC ATC T GCATACCCAGC TCCAGTAT GAGTGCATC T CACCCT T C TAC C GC CGCC TGGGC AGCAGCAGGAGGACAT GT 

%- 1 C TGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAAT TGAGAACATCAC TGCT C CAAAGA 

p C CCAAGGGTTGC GCTGGCCG TGGC AGGC AGC C ATCTAC AGGAGGACCAGC GGGGTGC ATGACGGC AGCC TACACAAGGGAGC 

p GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
lh GTC ACCAT GAT CAAGACAGCAGACC TGAAAGT T GTT TTGGGGAAATT C TACC GGGAT GATGACC GGGATGAGAAGAC CATCC 

p v | AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
■Kg b CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
p% T CCCACATCAC T GTGGC TGGCTGGAATGTCC T GGC AGACGTGAGGAGCCC TGGCT TC AAGAACGACAC ACT GCGC TC TGGGG 

TGGTC AGT GT GGTGGAC TCGCTGC TGTGTGAGGAGCAGC ATGAGGACCATGG CAT C CCAGTGAGTGT C ACT GATAAC ATGT T 
5 " C TGTGCCAGC T GGGAACCC AC TGC CCC TTCT GATATCTGC AC T GCAGAGACAGGAGGCATC GC GGCT GTGTC C TTC C C GGGA 

C GAGC ATC TCC TGAGCCACGC TGGC ATCTGAT GGGACTGGTCAGC TGGAGC TATGATAAAACATGCAGC CACAGGC TC TCC A 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGC C AGGGCT TC TGAC T TCAGGGACAAAAC TCAGTGAAGGGTGAGTAGACC T CCAT TGC T GGTAGGC TGATGCC GC GTCCA 
C TAC T AGGACAGCCAAT TGGAAGATGC C AGGGC TT GC AAGAAGTAAGT T TCT TCAAAGAAGACCATAT ACAAAAC C T C TCCA 
C TC CAC TGACC T GGT GGTC T TCCCCAAC T TTCAGT TATAC GAATGCCATCAGC TT GACCAGGGAAGATC TGGGCT T CATGAG 
GCCC C T T T TGAGGCT C TCAAGT TC TAGAGAGC TGC C TGT GGGACAGCCC AGGGCAGCAGAG CTGGGATGTGGTGCAT GCC T T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEW T I HAKPG FVI QLRFVMLS LE FDYMC QYD YVEVRDGDNRDGQ 1 1 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYST^AFSKQKLQSAPTK 
KPALP FGDLPMGYQHLHTQLQYEC I S PFYRRLGS SRRTCLRTGKWS GRAPSCI P ICGKI ENI TAP 
H 8 KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGPCVTMIKTADL 
■J* KWLGKFYRDDDRDEKT I QSLQ I SAI I LHPNYDP I LLDAD I AILKLLDKARI S TRVQP I CLAASR 

S DLSTSFQESHITVAGWOTLADVRSPGFKNDT 

||.| SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHIMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
III ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 
CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
T GGTCTT GCCT T GGCT CAGT CCT GCTAACTACATT GACAAT GT GGGCAAC CT GCACTT CCT GTATT CAGAACT CT GT A 
AAGGT G CCTC C C ACTAC GGC CT GACGAAAGAT AGGAAGAGG CGCT CACAAGAT GGCT GT C CAGAC GGCTGT GC GAGCC 
T CACAGCCACGGCTCCCT CCCCAGAGGTTT CT G CAG C T GCCAC CAT CT CCTTAATGAC AGACGAGCCT GGCCTAGACA 
ACC CT GCCTAC GT GT C CT CGG CAGAGGAC GGGCAGC CAGCAAT CAGCC CAGTGGAC T CT GGCC GGAGCAACC GAACT A 
G GGCAC GG CCCTT TGAGAGAT C CACTAT TAGAAGCAGATCAT TTAAAAAAATAAAT CGAGCTT TGAGT GTT CT T C GAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGC^GGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
GCCTCT CT ATTAGGCT GGT GGGAGGT AGCGAAACCC CACT GGT CCATATCATT AT CCAACACATTT AT CGT GAT GGGG 
T GATC GCCAGAGAC GGC C GGCT ACT GC CAG GAGACATCAT T CT AAAGGT CAACGGGATGGACAT CAGCAAT GT CCCTC 
i , £ ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
jf'f GCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCGAGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCC 
s . s CCGAGGAGCAGCTT GGAATAAAACT GGT G CGCAAGGTGGAT GAGCCTGGGGTT T T CAT CTT CAAT GT GCT GGAT GGCG 

GTGTGGCATATCGACATGGTCAGCTTGAGGAGAAT GACCGTGT GTTAGCCATCAATGGACATGATCTT CGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

si 

p GGAG CCCT GACAT CTT T CAGGAAGCC GGCTG GAACAGCAAT GGCAGCT GGT CC C CAGG GCCAGGGGAGAGGAGCAACA 

lh CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

y i 

ft\ GCATGACC GT CGCAGGGGGAGCAT CACATAGAGAATGGGATTT GCCTAT CTAT GT CAT CAGTGT T GAGC C CGGAGGAG 

fi l TCAT AAGCAGAGAT GGAAGAATAAAAACAGGTGACATTT T GTT GAAT GT GGAT GGGGT CGAACT GACAGAGGT CAGCC 



0 
m 



GGAGT GAGGCAGT GGCATTATTGAAAAGAAC AT CAT CCT CGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGT CATGT GGCT GGAAT T ACCACGGTGCT T GT ATAACT GTAAAGATAT T GT AT T ACGAAGAAACACAGCT G GAAGTC 
T GGGCT TCT GCATT GTAGGAG GT TAT GAAGAAT ACAAT GGAAACAAAC CTTTT TT CAT CAAAT C CATT GTT GAAGGAA 
CAC CAGCATACAAT GATGGAAGAATTAGAT GTG GT GATAT T CTT CTTGCT GTCAAT GGTAGAAGTACAT CAGGAATGA 
TACAT GCTT GCTT GGCAAGAC TGCT GAAAGAAC T TAAAGGAAGAATTACTCTAACTATT GTTT CTT GGCCTGGCACTT 
TTTTATAGAAT CAAT GAT GGGTCAGAGGAAAACAGAAAAAT CACAAATAGG CTAAGAAGT T GAAACACTATAT TTATC 
TTGT CAGT T TTTAT AT TT AAAGAAAGAATAC AT T GTAAAAAT GT CAG GAAAAGTAT GAT CAT CT AAT GAAAGCCAGTT 
ACAC CT CAGAAAATAT GAT T CCAAAAAAATT AAAACTACTAGTT TTTT TT CAGT GT GGAGGAT TTCT CATTACT CT AC 
AACATT GT T TATAT TT TTT C TAT T CAATAAAAAGCCCT AAAACAACTAAAATGATT GAT TT GT AT AC CC CACT GAATT 
CAAGCT GAT TTAAATTTAAAATTT GGTATAT GCT GAAGT CT GC CAAGGGTACAT TATGGCCAT T TT TAATTTACAGCT 
AAAAT ATTTTT TAAAAT G CATT GCTGAGAAACGTT GCT TT CAT CAAACAAGAATAAATATT T TT CAGAAGTT AAA 
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FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARPFERS TIRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
RE QKFRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG I KLVRKVDE PGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGS PE S AAHLI QAS ERRVHLWSRQVRQRS PD I FQE AGWNSNG SWS PG 
PGERSNTPKPLHPT I TCHEKWNI QKDPGESLGMTVAGGASHREWDLPI YVI SVEPGGVI SRDGR 
I KTGD I LLNVDGVE LTE VSRSE AVALLKRT S S S I VLKALEVKE YE PQEDCS S PAALDSNHNMAP P 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites, 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites . 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 



AppJD=l 0063537 



Page 



FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAA.CATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CAC CAGTAACTACTTCGTGGGTGCCAT TCAAGAGATT CGTAAAGCAAAGGAGT TCAT GGCTAAT TTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTGGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

T AAAAAGT T TAAT CGAGCC AAACT CTT GAAT GT GGGCT AT CTAGAAGCCCT CAAGGAAGASAATT GGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 

CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 

GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 

GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 

CCT GAAGT GGGT AAATATACAAT GGTCTT CCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAACG 

GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 

T AGTATCTGTGGAACACAAT CCT TT ATATATCAACATCACAGTGGAT TT CT GGTT TGGTGCATGACCC 

T GGATCT T T TGGT GATGTT T GGAA.GAACT GATT CT T T GTT TGCAAT AAT T T TGGCCTAGAGACT T CAA 

ATAGTAGCACACATTAAGAACCT GTTACAGCT CAT T GT TGAGCTGAAT T TT T CCT TT TTGTATT T TCT 

TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 

CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 

AATGAACGCTAT T TGAGGACTCT GGTTGAAGGAGAT TTAT TTAAATTTGAAGTAATATAT TATGGGAT 

AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 

AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 

GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 

GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 

CTTCAGGGGAGGACCT GCCCAGGTATGCCT TCCAGTGATGCCCACCAGAGAAT ACAT TCT CTAT TAGT 

T TT TAAAGAGT TTTTGT AAAATGATTT TGTACAAGTAGGATAT GAAT TAGCAGTTTACAAGTTTACAT 

ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLT WGWATSNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites . 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE A* 

GC TCAAGACC CAGCAGTGGGACAGC CAGACAGACGGCACGATGGCAC T GAGC TCCCAGAT CTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 

MALSSQIWAACLLLLLLIASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGAT CT AC TATG TGGGGATATACAGCT CAT CAC T CCAGCAGCCC T CCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGT GACCAATCTGACATGCT GCAT GGAACATGGGGAAGAGGATGT GATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGAT GGGGAGAAAGT GATATGACC T T CAT CTGCGT T GCCAGGAACCC TGT CAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T TC TGAAGAGAGAGAGACAAGAAGAGT ACAT TGAAGAGAAGAAGAGAGT GGACAT T TGTCGGG 
AAAC T CCTAACATAT GCCCCCAT TC TGGAGAGAACACAGAGT ACGACACAATCCC TCACACT AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTT TACT CCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAA.SGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 

EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 

PKVTMGLQSNKNGTCVTNLTCGMEHGEEDVIYTWKZU.GQAANESHNGSILPISWRW 

VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 

EKKRVDI CRE T PN I C PHS GENTE YDT I PHTNRT I LKE DPANT VYS T VE I PKKMENPHS LL TMPDT 

PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 



PI Leucine zipper pattern. 

111 amino acids 229-251 



N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T GT GAATT TTCATTGAAAAACAT CAGT GACAT TCATCCAGAATCCT TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACAC CAT GGC GAGTGGC T GG AGAGCAT C T AGT T TCCAC T T CGAT T C TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T T AAT G GGAAT AAAAT G TAAG T AT CAG TAG T T T GAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMS LTARKRACCNNRT GMFL S S FFS VI T VI GAL YCML I S I QALLKGPLMCNS P SNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features : 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



amino acids 223-227 




N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 



amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCTGCAGAGTCACACAGTTGC TGACCAATTGAGCTGTGAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
j** CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
0 TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
pi TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



ill 



ru 

III 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCC T TGGCCAAGGGACCAGAGAAGCAGT T GGCACT GGAGT CAGGCAGGT T CCAGGC T TT GGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACT CAGCAGGCAGCT T TGGAAT GAAT C CTCAGGGAGC TCCC TGGGGT CAAGGAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAAT T T T AAATCCAAGC T GGGT T TCAT CAACTGGGAT GCCAT AAACAAGGACCAGAGAAGC TC T C 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAkA&AAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 52 



o 
III 
p 
ill 

III 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
E ALGQGTREAVGT GVRQVPG FGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGS WQGVP 
GHSGAI^TSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDT FWKNFKSKLGFINWDAINKDQRS SRI P . 



t 

|Jl Signal peptide: 

M amino acids 1-21 

III 

N-glycosylation site. 

Ill amino acids 2 65-269 

r.H 

£ Glycosaminoglycan attachment site. 



amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 
Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 
N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence . 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 

CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 

ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 

CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 

TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 

CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 

AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 

GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 

CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 

CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 

ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

AGCCAAGTCCAAGAC T T T GGATT TCAT TGAT GT GCT TCT GCTGAGCAAGGAT GAAGATGGGAAGG 

CATTGT CAG ATGAGGATATAAGAGCAGAGGCT GACACCT T C AT GT T T GGAGGCCAT GACACCACG 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

ACAGGAGGTGCAAGAGCT TCTGAAGGACCGCGATCCTAAAGAGAT TGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 

CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 

GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 

GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 

GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC, 

CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLL S LPWLGLRPVAMS P WLLLLLWGS WLLARI LAWT YAFYNNCRRLQC FPQPPKRNWFWGHLG 
L I TPTEEGLKDS TQMS AT YS QGFTVWLGP 1 1 P FI VLCHPDT IRS I TNASAAI APKDNLFIRFLKP 
WLGEG ILLS GGDKWSRHRRML TPAFHFNI LKSYITI FNKSANIMLDKWQHLASE GS SRLDMFEH I 
SLMTLDSLQKC I FS FDSHCQERPSE YIAT I LELSALVEKRSQH I LQHMDFLYYLS HDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAF I PFS AGPRNC I GQAFAMAEMKVVLALMLLHFRFLPDHTEPRRKLEL IMRAEGGLWLRVE PLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites . 

amino acids 112-116, 168-172 
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FIGURE 

ATCGCAT CAAT TGGGAGTACCATCT TCCT CATGGGACCAGTGAAACAGCTGAA.GCGAATGT TTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACrTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA TT CAT GGCCAGT T T TAT GAAGCT T T GGAAGGCACT AT GGACAGAAGC T GGTGGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GATT CGAACAT T T GAGGGT TACT T T TGGAAGCAACAAT ACAT TCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T C T GGAT GT T GT CCC AC T GAAT TCCCAT GAAT AC AAAC C T AT TCAGC AACAGCAAAAAAAAAAAA 
AAAA2AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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■ FIGURE s6 

MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWY^ 
DAVKKC FAVC LA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE 52 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 

TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 

ATTTTCAAGCT CAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGT GGAAGCAGAAAATT GTACAAGCT 

GGAGATAAGGAC CT T GAT GG GCAGCTAGACTTT GAAGAATTT GT CCAT TATCT CCAAGAT CAT GAGAAGAAG C T GAGG 

CT GGT GTTTAAGAT TTT GGACAAAAAGAAT GATG GACGCATT GACGCGCAGGAGAT CAT GCAGT C CCT GCGGGACTT G 

GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAATTCTCAAGAGCATGGATAAAAACGGCACGATGACCATCGACTGG 

AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 

ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 

CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 

C AG GT C CAT G C CT C C C GC AG CAAC AAC AT GG G CAT C GT T GGT G GCT T C ACT CAGAT GAT T C GAGAAG GAGGG G C CAG G 

T CACT CT G GCGGGGCAAT GGCAT CAAC GT CCT CAAAAT T GCC CCCGAAT CAGC CAT CAAATT CAT GGCCT AT GAGCAG 

ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 

AT C GCCCAGAGCAGCAT CTACC CAATGGAGGT CCT GAAGACCC GGAT GGCGCT GCGGAAGACAGGCCAGT ACT CAGGA 

ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 

GG CAT CAT CC CCT AT GCCGGCAT C GAC CT T GCAGT CTACGAGACGCT CAAGAAT GCCT GGCT GCAGCACT AT GCAGT G 

AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 

CCCCT GGC CCTAGT CAGGAC CC GGAT GCAGGCGCAAGC CT CTATT GAGGG CGCTCCGGAGGT GAC CAT GAGCAGCCT C 

TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 

GCT GT GAGCAT C AGCTACGT GGT CTAC GAGAACCT GAAGAT CACC CT GGGC GT GCAGT CGCGGTGACG GGGG GAGGGC 

CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 

AAGCT GT CT C GAGCCAAGCT GT GAAAACCCTAGACGCACC C GCAGGGAGGGT GGGGAGAGCT GGCAGGCCCAGGG CT T 

GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 

CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 

ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 

TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 

CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 

GGC CT GGACC CT GTCAGGAT GGGCCCCAC CT CAGAACCAAACT CACT GT C C CCACT GT GGCAT GAGGGCAGT GGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 

ACT GTT GGGAAAAGGGTTTTGTCCAGAAGGACAAGCCGGACAAAT GAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 

GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

GGGAC CAGCC CCACAT T C CACTT GT GT CACT GCT T G GAACCTATTTATTT T GTATT TAT TT GAACAGAGTTAT GT C CT 

AACTATT TTTAT AGAT TT GTTTAATTAATAGCT T GT CATTT TCAAGTT CATTTT TTATT CATAT TT AT GTTCAT GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 

GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 

GTT GGGGAAGGGCT T GCCCCCAG C CT TAGGATTT CAGGGT TTGACT GGGGGCGT GGAGAGAGAGGGAG GAAC CT CAAT 

AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 

TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 

CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 

T G AGG AAAT C T T CAAT AG GAT G CAAAGAT CAAT G C AAAAATT GT TAT AT AT GAAC AT AT AAC T G GAGT C GT C AAAAAG 

CAAATTAAGAAAGAAT T GGAC GT TAGAAGTT GT CATTTAAAGCAGCCTT CTAATAAAGT T GTT T CAAAGCT GAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE *8 



MLCLCLYVPVI GEAQTE FQY FE S KGLPAELKS I FKLS VFI PS QE FS T YRQWKQKI VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQE IMQSLRDLGVKI SEQQAEKILKSMDKNG 
TMT IDWNE WRD YHLLHPVENI PE I ILYWKHS T I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVS R TC TAPLDRLKVLMQVHASRSNNMG I VGGFT QMI REGGARS LWRGNG INVLK I APE S AI K 
F^YEQIKRLVGSDQETLRIHERLVAGSIAGAIAQSSIYPMEVLKTR3)1ALRKTGQYSGMLDCARR 
I IxAREGVAAFYKGYVPNiyjLG 1 1 PYAGI DLAWETLKNAWLQHYAVNSADPGVFVLLACGTMS STC 

f, ? i» GQLAS YPLALVRTRMQAQAS I EGAPEVTMS SLFKHI LRTEGAFGL YRGLAPNFMKVI PAVS I S YV 

P VYENLKITLGVQSR 

0 
St 

l's Important features: 

Iff Signal peptide: 

|l| amino acids 1-16 

1„ Putative transmembrane domains: 

Irl 

|P amino acids 284-304, 339-360, 376-394 

s 

III Mitochondrial energy transfer proteins signature. 

M amino acids 206-215, 300-309 



m 



N~glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein, 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAA.GGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAA.CCTTCCCCAGCCATGGC 

TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 

CAC T CAT CAT T GGC T T T GGTAT T T CAGGGAGACAC TCCATCACAGTCAC TACT GTCGCCT CAGCT 

GGGAACATT GGGGAGGAT GGAAT CC TGAGCT GCAC T T T T GAACC T GACAT CAAAC T T TC T GAT AT 

CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 

AGC TGT C GGAGCAGGAT GAAAT GT TCAGAGGCCGGACAGCAGT GT T TGC T GAT CAAGTGAT AG T T 

GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 

CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 

AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

TCAACAACACATACTCCT GTATGAT TGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTG 

ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

GTGC CT T GGCCACAAAAAAGCAT GCAAAGT CAT TGT TACAACAGGGAT C T ACAGAACT AT T T CAC 

CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 

AGC AAACAAG AGC AAG AAAC AAAAAGAAG CCAAAAG C AG AAG GC T C C AAT AT GAACAAG AT AAA.T 

C TATC T T CAAAGACAT AT T AGAAGT TGGGAAAATAAT T CAT GT GAAC T AGACAAGTGTG TT AAGA 

GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 

TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 

AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 

GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 

T C T T CCCAACT GACAAAT GCCAAAGT TGAG AAAAAT GAT CAT AAT T T T AGC ATAAACAG AGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVWDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSEOTTMKWSVLYWTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 

CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 

AT T GGT CTT C AT T C AAT GAT GAGACC T GCCT CATGAT GAT AAACAT GT T T GAC AAGACC AAGT CA 

GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 

AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCC T T C CGGGAGAAGGACAC AGC TGT ACAAGGCAACAT CCGGC TCAGCT T CGAGGACT T CG T CA 

CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 

TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 

AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 

ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 

CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 

CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 

CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 

TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 

AT TTGGGGCCAAAAGT CCAGTGAAAT TGTAAGCT TCAATAAAAGGATGAAACTCTGA 



FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S G Y I SMKE LKQAL VNCNWS S FNDE T CLMM I NMFDKTKS GR I DVYG FS ALWKFI QQWKNL FQ Q YDR 
DRS GS I S YTELQQALS QMGYNLS PQFT QLLVSRYCPRSANPAMQLDRFI QVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

AT C T GGGGTAT CAATAAT CAACCCC CAGGTACCAGC TGGGGAAATAT T AAT CGG TATCCAGGAGG 

CAGC TGGGGAAATAT TAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGGTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGC TAGAGCACGAT AGAGGGAAACCCAACAT TGGGAGT TAGAGTCC T GC T CCCGCCC C T T GC T G 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

T G C T CAT C T C C AAT AAAAT AAPsAG C AC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAV^ 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6 s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNS S YES S FLELLEKLCLLLHLPSGTS VTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



AC GGACCGAGGGT TCGAGGGAGGGACACGGACCAGGAACCT GAGC TAGGT CAAAGACGCCC GGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 

AAATGAG AAT AGC AC TGTTTTGCCT TCAT CC ACCAGC T CCAGC T CCGAT GGCAAC CTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
u GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
g| GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
P- GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
& I GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
}1f AGACTCT GCCCC TGAGGT CAAGAGAGGAT GGGGC TATT CACT T T TATATAT T TATATAAAAT TAG 

0 £ 

T AGTGAGAT GT AAAAAAAAAAAAAAAAAA 

- a 

P 

ru 
» 
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FIGURE 68 

MANPGLGLLIJ^GLPFLLARWGRAWGQIQTTSi^ENSTVLPSSTSSSSDGNLRPEAITAIIVVFS 
LLAALLIAVGIALLVRKLREKRQTEGTYRPSSEEQFSHAAEi^APQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

|*| Transmembrane domain: 

amino acids 56-80 

tt% 

h? - 

Lli 

1 1| N-glycosylation site. 

|,|| amino acids 36-40 

s cAMP- and cGMP-dependent protein kinase phosphorylation site. 

P amino acids 86-90 

B.! Tyrosine kinase phosphorylation site. 

Cl amino acids 86-94 

I' Si s 

N-myristoylation sites. 

amino acids 7-13, 2 6-32 
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FIGURE 6q 

GCCAGGAAT AACTAGAGAGGAACAATGGG GTT ATT CAGAGGT TTT GT TT T C CT CTTAGT T CT GT GCCT GCTGCACCAG 
T CAAATACT T CCTT CATTAAGCT GAAT AATAAT GGCTT T GAAGATATT GT C ATT GTTATAGAT C CTAGT GTGCCAGAA 
GATGAAAAAAT AAT T GAACAAAT AGAGGAT AT GGT GACTACAGCT TCTAC GTACCTGTT T GAAG C CACAGAAAAAAGA 
TTTTTTTTCAAAAATGTAT CTATAT TAATT CCT GAGAATT GGAAGGAAAATCC T CAGT ACAAAAGGCCAAAACAT GAA 
AAC CATAAACAT GCT GAT GTTATAGTT GCACC ACCTACACTCC CAGGTAGAGAT GAAC CATACAC CAAGC AGT T CACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTT T GT C CAT GAGT GGGCT CACCT CCGGT GGGGAGT GT TTGATGAGTACAAT GAAGAT CAGCCT TT C 
T AC CGTGCT AAGT CAAAAAAAAT CGAAGCAACAAGGT GTT C C GCAGGT AT CT CT GGTAGAAATAGAGTTT ATAAGTGT 
CAAGGAGGCAGCT GT CTTAGT AGAGCAT GCAGAAT TGATT CTACAACAAAACT GTAT GGAAAAGATT GTCAATT CTTT 
CCT GAT AAAGTACAAACAGAAAAAGCAT CCATAAT GT TT ATGCAAAGTATTGATT CT GTT GTT GAATT TT GT AACGAA 
AAAACCCATAAT CAAGAAGCT CCAAGC CTACAAAACAT AAAGT GCAATTTTAGAAGTACAT G GGAGGT GATTAGCAAT 
TCT GAGGATTTT AAAAACACCATACCCAT GGT GACACCACCT C CT CCACCT GT CT T CT CAT T GCT GAAGAT CAGT CAA 
AGAATTGT GTGCTTAGTT C TT GATAAGT CT GGAAGCAT GGGG GGT AAGGACCG CCTAAAT C GAAT GAAT CAAGCAGCA 
AAACATT T CCT GCTGCAGACT GT TGAAAAT GGATCCT GGGT GGGGAT GGT T CACTT T GATAGT ACT GCCACTATT GT A 
AATAAGCTAAT CCAAATAAAAAGCAGT GAT GAAAGAAACACACTCAT G GCAGGATTAC CTACAT AT C CT CT GGGAGGA 
ACTTCCATCTGCTCT GGAATTAAATAT GCATTTCAGGTGATT GGAGAGCTACATTCCCAACTCGAT GGATCCGAAGTA 
CT GCT GCT GACT GAT G GGGAGGATAACACT G CAAGTT CTT GTATT GAT GAAGT GAAACAAAGT GGGGC CATT GTT CAT 
TT T AT T GCTTT GGGAAGAGCT GCT GAT GAAGC AGTAATAGAGATGAG CAAGAT AACAGGAGGAAGT CATTT TTAT GTT 
T CAGAT GAAG CT CAGAACAAT GGCCT CATT GAT GCTT TTGGGGCT CTT ACAT CAGGAAAT AC T GAT CT CT CC CAGAAG 
TCCCTT CAGCTCGAAAGTAAGGGATTAACACT GAATAGTAATGCCTGGATGAACGACACT GTCATAATTGATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAAT GGAAAATTTCACAGT GGAT GCAACT T C CAAAAT GGC CT AT CT CAGT ATT CCAGGAACT GCAAAGGT GGGCACT 
T GGGCAT ACAAT CTT CAAGC CAAAGCGAACC CAGAAACATTAACTAT T ACAGT AACTT CT CGAGCAGCAAATT CT T CT 
GT GC CT CCAAT CACAGT GAAT GCTAAAAT GAAT AAGGAC GTAAACAGTTT CCCCAGCC CAATGATT GTTT ACG CAGAA 
AT T CT ACAAGGATAT GTAC CT GT T CT TGGAGC CAAT GT GACT GCT TT CATT GAAT CACAGAAT GGACAT ACAGAAGTT 
T T G GAACT T TT GGATAAT GGT GCAGG CGCT GAT T CTT T CAAGAAT GAT GGAGT CT ACT CC AGGTAT TT TACAGCATAT 
ACAGAAAAT GGCAGATATAGCTT AAAAGTT C GGGCT CAT GGAGGAGCAAACACT GCCAGG C TAAAATTAC G GCCT C CA 
CT GAATAGAGCC G C GTACATACCAGGCT GGGT AGT GAAC GG GGAAATT GAAGCAAACC CGCCAAGACCT GAAATT GAT 
GAGGATACTCAGACCACCTT GGAGGATTTCAGCCGAACAGCAT CCGGAGGT GCATTTGTGGTATCACAAGTCCCAAGC 
CTT CC CTT GCCTGAC CAAT ACCCACCAAGT CAAAT CACAGACCTT GAT GC CACAGT TCATGAGGATAAGAT TATT CTT 
ACAT GGACAGCACCAGGAGAT AAT TT TGAT GTT GGAAAAGT TCAAC GTT AT AT CAT AAGAATAAGT GCAAGTAT T CTT 
GAT CTAAGAGACAGTT TT GAT GAT GCTCTT CAAGTAAATACT ACT GAT CT GTCACCAAAGGAGGCCAACT CCAAGGAA 
AGCTTTGCATTTAAACCAGAAAATAT CT CAGAAGAAAATGCAACCCACATATTTATTGCCATTAAAAGTATAGATAAA 
AGCAATTT GACAT CAAAAGTATC CAACATT GCACAAGTAACTT T GTTTAT CCCT CAAGCAAAT CCT GATGACATT GAT 
CCTACACCTACTCCTACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTATTG 
T CT GT GATT GGGT CT GTT GT AAT T GT TAACTTTATTTTAAGTACCACC AT TTGAAC CTTAAC GAAGAAAAAAAT CTT C 
AAGTAGACCTAGAAGAGAGTT TTAAAAAACAAAACAAT GT AAGT AAAG GATAT TT CT GAAT CTTAAAAT T CAT CCCAT 
GTGT GAT CAT AAACT CAT AAAAATAATT TT AAGAT GTCGGAAAAGGATACTTT GAT TAAAT AAAAACACT CAT GGATA 
T GTAAAAACT GTCAAGATTAAAAT TTAAT AGTTT CATTTATTT GT T AT TTTATTTGTAAGAAATAGT GAT GAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAATCATCTATCTGAGTAGT CAAAAT ACAAGTAAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
I HFT PDLLLGKKQNE YGPPGKL FVHEWAHLRWGVFDE YNEDQP FYRAKSKKI EATRC SAG I S GRN 
RVYKCQGGSCLSRACRI DSTTKLYGKDCQFFPDKVQTEKAS IMBMQS I DSWEFGNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDAT SKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANS SVPPITWAKMNKDVNSFPSPMI VYAE I LQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADS FKNDGWSRYFTAYTENGRYS LKVRAHGGANTARLKLRPPLNRAAY I PGWWNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 

VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 



CT CCTTAGGT GGAAACC CT GG GAGT AGAGT ACTGACAGCAAAGACC GGGAAAGAC CATAC GT CCCCGGGCAGGGGT GA 
CAACAG GTGT CAT CTTTTT GAT CT CGT GT GTGGCT GCCTT CCT AT TT CAAGGAAAGACGC CAAGGT AATTTT GAC CCA 
GAGGAGCAAT GATGTAGCCACCT C CTAACCTT CC C TT CTT GAACCCC CAGTTAT GCCAGGATT TACTAGAGAGT GTCA 
ACTCAACCAGCAAGC GGCT C CTTC GGCTTAACTT GTGGTT GGAGGAGAGAACCTT T GT GG GGCT GCGT T CTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GT GAT GGGAAT GAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTT CTGCCTGCAAGAT CAT CCTTTAAA 
AGTAGAGAAGCT GCT CT GT GTGGT GGTT AACT CCAAGAGG CAGAACT C GTT CT AGAAG GAAAT GGAT GC AAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GT TT T GCT GGTGCT C CT CT GCT GT GCTAT CTCT GT CCT GTACAT GTT GGC CT G CACCC CAAAAGGT GAC GAGGAGCAG 
CT GGCACT GC CCAG GGC CAACAGC CCCACGGGGAAG GAGGGGT AC CAGGC CGT CCT TCAGGAGT GGGAGGAG CAGCAC 
CGCAACT AC GTGAGCAGCCT GAAGCGGCAGAT CGCACAGCTCAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
AAT GGGCAGT AC CAAGC CAGCGATGCT GCT GGC CT GGGTCT GGACAG GAG CCC CCCAGAGAAAAC CCAGG C CGACCT C 
CT GG CCT T CCT GCACTC GCAGGT GGACAAGGCAGAGGT GAAT GCT GGC GT CAAGCT GGCCACAGAGT AT GCAGCAGT G 
CCTT T CGATAGCTTTACT CTACAGAAGGTGTAC CAGCT GGAGACT GGC CT TAC CCG CCACCCC GAGGAGAAGCCT GT G 
AGGAAGGACAAGCGGGAT GAGT T GGT G GAAG C CAT T GAAT CAGCCTT G GAGACCCT GAACAAT CCT G CAGAGAACAG C 
C CCAAT CACC GT C CTT ACAC GGC CT CT GATTT CAT AGAAG GGAT CTAC CGAACAGAAAGGGACAAAGGGACAT TGTAT 
GAGCT CAC CT T CAAAGGGGACCACAAACAC GAATT CAAAC GGCTCAT CTT AT TTCGACCAT TCAGCC CC AT CAT GAAA 
GT GAAAAATGAAAAGCT CAACAT GGCCAACAC GCT TAT CAAT GTTAT C GT GCCTCTAGCAAAAAGGGT GGACAAGT T C 
CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAAT AAAT GAAGTCAAAGGAATACTT GAAAAC ACT T CCAAAGCT G CCAACTT CAGGAACTTT ACCTT CAT C 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCT GAATTCCT CAATACGTGTAGGCT GAATACACAGCCAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCT GGT CATAAAGAAGGAAACTG GATTTTGGAGAGACTTT GGATT T GGGAT GACGT GT CAGTAT CGGT CAGAC 
TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 

CACAG CAAC CT CATAGTGGTACG GACGCCT GTGCGAGGACT CTTC CAC CT CTGGCATGAGAAGCGCT GCAT GGACGAG 
CTGAC CCCCGAGC AGTACAAGAT GT GCAT GCAGT C CAAG G CCAT GAAC GAGGCATCCCACGGCCAGCT G GGCAT G CTG 
GTGTT CAGGCACGAGAT AGAGGCT CACCTT C GCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACT CC C^\GA 
GAAGGATT GT GGGAGACACTT TTT CT TT C CT TT T GCAAT TACT GAAAGT GGCT GCAACAGAGAAAAGACTT C CAT AAA 
GGACGACAAAAGAATT GGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCT CTCT GTT GGGCTTTTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AAT AT TCAT GATT T AAGAGCAGT TTT GT AAAAAATT CATTAGCATGAAAGGCAAGCAT ATT T CT CCT CATAT GAAT GA 
GCCTATCAGC AGGG CT CTAGT TT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCTTAT TAT GATACT 
AGT GAGTACAT TAAGT AAAATAAAAT GGACC AGAAAAGAAAAGAAACCATAAATAT CGTGT CAT ATTT T CCC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT TTTT CCCTT GT GAGTT ATAGT CT GCT T ATT T AATTAC CACTTTGCAAGCCTTACAAGAGAGCACAAGTT GGCCTAC 
ATTT TT AT ATTTT T T AAGAAGATACTT T GAGAT GCAT TATGAGAACTT T CAGTT CAAAGCAT CAAATT GAT GCCATAT 
C CAAGGACATGC CAAAT GCTGAT T CT GT CAGGCACT GAATGT CAGGCAT TGAGAC ATAGGGAAGGAAT GGT TT GTACT 
AATACAGACGTACAGAT ACTTT CT CT GAAGAGT ATTTT C GAAGAGGAGCAACT GAACACTGGAGGAAAAGAAAATGAC 
ACT TTCT GCTT T ACAGAAAAGGAAACTCATTCAGACTGGTGATATC GT GAT GTACCTAAAAGT CAGAAACCACATTTT 
CT C CT CAGAAGT AGGGACCGCTT T CT TAC CT GT TT AAATAAACCAAAGTAT AC CGT GT GAAC CAAACAAT C T CTTTT C 
AAAACAGGGT GCT CCT C CT GGCTT CT GGCTT CCATAAGAAGAAATGGAGAAAAATATAT ATATATAT ATATATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACTGAATTAT TT TTTAAAT TAAGCAGTTCT ACTCAAT CACCAAGAT GCTT CT GAAAATT GCATTTTATT ACCATTT 
CAAACT AT T TT TTAAAAAT AAAT ACAGTTAACATAGAGT GGT TT CTT CATT CAT GT GAAAATTATT AGC CAGCACCAG 
AT GCAT GAGCT AAT TAT CT CT TTGAGTCCTT G CT TCT GT TT GCT CACAGTAAACTCATT GT TTAAAAGCTT C AAGAAC 
AT TCAAGCT GT T GGT GT GT TAAAAAAT GCATT GT ATT GATTT GTACT GGTAGTT TATGAAAT TT AAT TAAAACACAGG 
C CAT GAAT GGAAGGT GGTATT GCACAGCTAATAAAATAT GATTT GT GGAT AT GAA 
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MMMVRRGLIAWI SRVW^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGWIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENS PNHRPYT AS DFI EGI YRTERDKGTLYELT FKGDHKHE FKRL I L FRPFS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRT PVRGL FHLWHEKRCMDELT PEQYKMCMQSKAMNEASHGQLGMLVFRHE I EAHL 




RKQKQKT S SKKT 



Important features: 
Signal peptide: 

amino acids 1-27 




N-glycosylation sites. 

amino acids 315-319, 324-328 



N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 



Amidation site. 



amino acids 377-381 
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GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 

GAAACCCAT GAT ACCC TACT GAACACCGAAT CCCC TGGAAGCCCACAGAGACAGAGACAGCAAGA 

GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 

CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 

CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 

GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 

GGAAACAATGCCCAGTCGCCCATCGATATT CAGACAGACAGTGTGACATT TGACCCTGAT T TGCC 

TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 

ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 

TGAAGCCACAT TTGCAGAGCTCCACAT TGTACATTATGACT CTGATTCCTATGACAGCT TGAGTG 

AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 

AATATAGC T TAT GAACACAT TC T GAGTCACT TGCATGAAGTCAGGCATAAAGAT CAGAAGACCT C 

AG TGC CT CC CT T CAACCTAAGAGAGC T GC T CCCCAAACAGCTGGGGCAGT AC TTCCGC T ACAATG 

GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

AT TTCAAT GGAACAGCTGGAAAAGCTTCAGGGGACAT T GT TCTCCACAGAAGAGGAGCCCTCTAA 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 

TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGTGT GGT CT TCACC T CAGCACAAGCCACGACTGAGGCATAAAT T CC T TCT CAGAT AC 

CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 

CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 

GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 

ATGAGGAAATCGCTGTGTT GT TAATGCAGAGAN CAAAC T CT GT T TAGT T GCAGGGGAAGT TT GGG 

ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 

C T CCT T AGGATAAAGAGT TGC TGT TGAAGT T GTATAT T T TT GATCAATATAT T TGGAAATT AAAG 



TTTCTGACTTT 
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MLFSALLLEVIWIIAMGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 

WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 



AppJD=10063537 



Page 224 



FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 

TCTGGTGGTTTGCCTAAACCTGCTy^LCATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

ATGGACT CCACCAGAGGGTCTT CAAGGAGT T AAAGT TACT TACACT GTGCAGT AT TTCATCACAA 

AT T GGCCCACCAGAGGT GGCACT GACTACAGATGAGAAGT CCAT T T C TGT T GT CCTGACAGC T C C 

AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 

AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 

ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 

GAT T T TGATT TATGGAAAT GAAT TT GACAAAAGAT T C T T T GT GCCT GCT GAAAAAAT CGT GAT T A 

AC T T TAT C AC C C T C AAT AT C T C G GAT GAT T C T AAAAT T T C T CAT C AG GAT AT GAG T T T AC T GGG A 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 

AAAACAC G GAAGG T AC T T C T C T C AC C C AG CAAGAG T C C C T C AGCAGAAC AAT AC C CC C GGAT AAA 

ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 

GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 

TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 

T GT GCAGATGGAAAACTGATGCCAACACTTCCTT TTGCCT TT TGTTT CCTGTGCAAACAAGTGAG 

TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 

CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 

GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYOTSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKI I FW YVLP I S I TVFL FSVMGYS I YRYIHVGKEKHPANL I LI YGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-2 8 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAA7VAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C CAAT C T AGT GAAAAACCAGCT GT G T C C C GT GAT CGAGGCT T CCT T C AAT GGCAT GTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGGCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 

T T T ACACCAAAGGT GACC AACT T AT AC TC AAC T TGAATAACAT C AGCT CT GATCGGAT CCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCGAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 

CCTGTGAAAAA 



FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 

PAGG I PVLGSLVNT VLKH 1 1 WLKVI TANI LQLQVKP SANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCP V IEAS FNGMYADLLQLVKVP I SLS I DRLE FDLLYPAIKGDT I QLYLGAKLLDS QGKVT 
KW FNNS AAS LTMPT LDN I P FS L I VS QDVVKAAVAAVLS PEE FMVLLDS VLPE S AHRLKS S I GL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 

KGDQL I LNLNNI S SDRI QLMNSGI GWFQPDVLKNI I TE 1 1 HS I LLPNQNGKLRS GVP VSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCC T GGCAGAGAGACTC T GAAAT GAGGGAT TAGAGGTGT T CAAGGAGCAAGAGC 
TT CAGC CT GAAGACAAGGGAGCAGT CCCTGAAGACGCT T C TAC TGAGAGGTCT GCCATGGCCT CT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCT CTGGATGGAATGTGCCACACACAGCACAGGCAT CACCCAGT GTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

CAAGCCCAACC T C T T GCCACAAGGAGCTC TCCAAGGCC T GG TCAACC T CCCAAAGTCAAGAG TGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGT CT G T GAAAAACAGT GGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCAC TGGAT CGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGAT GGGAGGACAGGAAGGCAGCC T GGGACAT T TAAAAAAAT A 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 

I YSTLLGLPADIQAAQAMMVTSSAISSLACI I SWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 

DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains : 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site, 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
W* CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

Kl ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

p! 

5! GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

I'll 

jlj TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

i|! AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

III ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

*** ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

p Ta GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

|JI AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

|3 CAGACCCAAGGGAGCAGGCAGCACAACT GT CAAGAT CGTCCT GAAGGAGAAACATAAGAAAGCCT 
HI 



El 



GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACAT AACAAAGACC TAACAGTT GCAGAT AT GAGCTGTATAAT T GT T GT TAT TAT AT AT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPEVRVLSSLLGLALLWFPLDSHARARP 

GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 82 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 

GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 

TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 

TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 

GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 

CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 

AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 

GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 

TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

AC AGACTCC AGG AC AAAC AG AGAC AT GC AT GGCCT GTT T G ATGT GGAG AT CT CTCT GACCGT CCAAGAGAA 

CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 

GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 

AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 

CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 

CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 

AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 

ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

T GAGTGT GGG ACCAT CTCCTT C TT C AAC AT AAAT GACCAGTCCCT T AT TT AT AC CCT G AC AT GTCGGT TT G 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 

CC AGTCACCC AGGAAT C AGAGAAAGAGGCCT CTT GGCAAAGG GCCT CT GC AATC CCAG AG ACAAGC AACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
T CTT CTT T AGGG AT AT T AAGGT CT CTCT CC CAGATC CAAAGT CCCGCAGCAGCC GGCC AAGGT GGCTT CC A 
GATGAAGGGGGACT GGCCT GTCCACATGGG AGTC AGGTGT CATGGCTGCCCT GAGCTGGG AGGG AAG AAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
AT CT T ATT GATG ACAGAGTGT ATCCT AATG GTTT GT TC AT TAT AT T ACACTT TC AGT AAAAAAA 



FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWAS QS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHG YWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI CPVT QE SEKEAS WQRASAI PET SNSE S S SQAT TP FLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCT C T GGGGGAGGGAGAGGGCGGAAGGACAGACAAG TAAACTGCT GACGAT GCAGAGT TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
Pi TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
HI AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
U] GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
^ GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
p ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
\| GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
» GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
Cl GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
51! CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

i-l 

ill GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

f It 

f J CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
fit TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCGCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGC CT TGGGAC T CGCGGGGACAGGAGGCCACT GACACCGAGT ACT CGGAGATCAAGATCCACAG 
ATGAGAAACT GCAGAGAC TCACCC T GATT GAGGGATCACAGCCCC TC CAGGCAAGGGAGAAGT CA 
GAGGCTGAT T C T TGTAGAAT TAACAGC CC T CAACGTGATGAGCTATGAT AACAC TATGAATTATG 
TGCAGAGTGAAAAGCACACAGGCTT TAGAGT CAAAGTATCTCAAACCTGAATCCACAC TGTGCCC 
TC CCT T T TATT T T TT TAAC TAAAAGACAGACAAAT T CC TA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANT DQDAPVATNNPARAVWEE TRDRFHLLGDPHTKNCT LS I RDARRS DAGRY FFRMEKGS I KWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVGAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVVVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGC T GCACT C TGT TGAGCT CCAGGGCGCAGTGGAGGGAGGGAGT GAAGGAGC T CT C T GTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAA.TGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAAC TACAACACCT T TGGATCTGCAGAGGCGGCCACGAGCGAT GAC TACAAGAACCCTGGC TACT 
Kl ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
p AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
lit GT TT GGCATCTACCAGAAATAT CCAGT GAAATATGGAGAAGGAAAG T GT T GGAC TGACAACGGCC 

h| CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
S| GGCCAGC GGGAATTCACT GCGGGAT T TGT T CAGT TCAGGGTAT T T AATAACGAGAGAGCAGCCAA 

* CGC CTTGTGTGC TGGAAT GAGGG T CACCGGAT GT AACAC TGAGCAT CACT GCAT TGGT GGAGGAG 

^ GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
K GGAAC T CAT GT TGGT T ACAGCAGCAGC CGT GAGATAAC TGAGGC AGC T G T GCT T C TAT T C T AT CG 

fj| T TGAGAGT T T T GTGGGAGGGAACCCAGACC TC TCCT CCCAACCATGAGATCCCAAGGATGGAGAA 

O CAAC TT ACCCAGTAGCT AGAAT G T TAAT GGCAGAAGAGAAAACAATAAATCATAT TGAC T CAAGA 

W AAAAAA 



App_ID=10063537 



Page 237 of 3 18 



FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWT DNGPVI PWYDFGDAQKTAS YYS PYGQRE FTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 



ill 

ill 



N-glycosylation site. 

amino acids 163-167 




Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 



N-myristoylation sites. 

amino. acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8Q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGAC TTATATACAGTT TTAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQS L I S I GAHCSAS VALS FFI FERWECTT YWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTT CCACCAGGAACAGCAGAT C T TCAT CGAAGCCAAGGCC TCCAAAAT C T TCAACTGCCGGAT G 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

AGAAAGGGT CCCAAGT GC TGGT CCC AACC T GAAGC T GT GGAGT GACT AGAT CACAGGAGCAC TGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



0 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEJ^SKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 



amino acids 1-14 



N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCC ATGA CT6CGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 

ctttatgtcttcaccatcgccatcgagccgttgcgtAtcatcttcctcatcgccggagctttctt 
ctggttggtgtctctactgatttcgtcccttgtttggttcatggcaagagtcattattgacaaca 
aagatggaccaacacagaaatatctgctgatctttggagcgtttgtctctgtctatatccaagaa 
atgttccgatttgcatattataaactcttaaaaaaagccagtgaaggtttgaagagtataaaccc 
aggtgagacagcaccctctatgcgactgctggcctatgtttctggcttgggctttggaatcatga 
gtggagtattttcctttgtgaataccctatctgactccttggggccaggcacagtgggcattcat 
ggagattctcctcaattcttcctttattcagctttcatgacgctggtcattatcttgctgcatgt 
attctggggcattgtattttttgatggctgtgagaagaaaaagtggggcatcctccttatcgttc 
tcctgacccacctgctggtgtcagcccagaccttcataagttcttattatggaataaacctggcg 
tcagcatttataatcctggtgctcatgggcacctgggcattcttagctgcgggaggcagctgccg 
aagcctgaaactctgcctgctctgccaagacaagaactttcttctttacaaccagcgctccagat 
aacctcagggaagcagcacttcccaaaccgcagactacatctttagaggaagcacaactgtgcct 

TT TTCTGAAAATCCCT TT TTCT GGTGGAAT TGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALYVFT IAIEPLRI I FL I AGAFFWLVS LLI S S LVWFMARVI I DNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 95 



AAT T T T T CACCAGAGTAAACT T GAGAAACCAACT GGACC T TGAG TAT T GTACAT TT TGCC TCGTG 
GAC CCAAAGGTAGCAATCTGAAAC ATGA GGAGTACGATTCTAC TG T T T T G TC TTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TAC CAAAC CAACAGCAG T CAAAT CAGG T CTTTCCTTCTT T AAG T C T GATACCAT T AACA 
CAGAT GCT CACAC TGGGGCCAGATC T GCAT CTGT TAAAT CC TGCT GCAGGAAT GACAC C T GGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
M CAATT TT TGTCACACAACT TGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 

M TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 

pi 

p TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
y GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TG AT TATAT TATGGAATAGAT T GAGACACAT T GGATAGT C T T AGAAGAAAT T AAT TC T T AAT T 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 
Iff CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
13 TAAAACATATT TGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



III 



a 



t .2 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 



hi 

yf 



en 



MRSTI LLFCLLGS TRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSL I PL TQM 
LTLGPDLHLLNPT^AGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DD D FAVT T PAG I QRS THAI E E AT TE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTrGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGC TACAGGTCACCCTGGAGATGCAGGATGGACATGT CT TGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
► , AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
r* GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
& CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
Q ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
||1 CCAGGCCTCAGGCCACCAGGCCACT GCCACCGT GGAAGT CT CCAT CAT AGAGAGCACC TGGGTGT 

P* CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
||| AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
%4 ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
L ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
W- CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
III CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
P TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
fl\ ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
jjf CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
M GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
HI GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT TT TCAC CCAC TGGACCATGT CAAGGAAGAAGGACCCGGAT CAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPG I P FL FLEAS DRDE PGTANS DLRFHI L SQAPAQPS PDMFQLE PRLGALALS PKG 
S T S LDHALERT YQLLVQVKDMGDQAS GHQATATVEVS 1 1 E S TWVS LEP I HLAENLKVLYPHHMAQ 
VHWS GGDVHYHLE S HPPGP FEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHE WVWQS VAKLVGPGPGPGATATVTVLVERVMP PPKLDQE S YEASVP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I PVWS HNAQMWQLLVRV I VCRCNVEGQCMRKVGRMKGMP TKL S AVG I LVGTL VAI G I FLI 
L I FTHWTMSRKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 

AGT CCAAGATTCT TCCCAGGAACACAAACGTAGGAGACCCACGCT CCTGGAAGCACCAGCCTTTA 

TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 

AT C T AGGAC CCAGGCATCT TGC T T T CCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 

TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 

TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 

TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 

GCAT AGT CACCAAC T C TGAGT T CCATACAACC TCCAG TGGGAT CAGCACAGCCACCAACT CT GAG 

T T CAGCACAGCGT CCAG T GGGATCAGCAT AGCCACCAAC TCTGAGT CCAGCACAACCT CCAGTGG 

GGCCAGCACAGCCAC CAAC TCT GAGT CCAGCACACCCTCCAGT GGGGCCAGCACAGT CACCAAC T 

CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 

AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 

CAACT C T GAC T CCAGCACAACCTCCAGT GGGGC TAGCACAGCCACCAACT C TGAGTCCAGCACAA 

CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 

GCCACCAACT CTGAGTC CAGCACAAC CT CCAGT GGGGCCAGCACAGC CACCAACT CT GAGTC CAG 

AACGACCTCCAATGGGGCTGGCACAGCCACCAAC TCT GAGT CCAGCACGACCTCCAGTGGGGCCA 

GCACAGC CACCAACTCT GAC TC CAGCACAGTGTCCAGTGGGGC CAGCACT GCCACCAAC TCT GAG 

TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 

GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 

CTGAGT CCAGCACAGTGT CCAGT GGGATCAGCACAGT CACCAAT TCT GAG T CCAGCACACCC TCC 

AGT GGGGCCAACACAGCCACCAACT C T GAG T CCAGTACGAC CT CCAGT GGGGCCAACACAGCCAC 

CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 

CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 

GC CACCAAC TC T GAC TCCAGCACAAC CT CCAGTGAGGCCAGCACAGCCAC CAAC TC T GAGTC TAG 

CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 

ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 

ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 

GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 

ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 

CCCAAGACC T GGT TTCC T T T CAT TCAT CCCAGGAGACCCC TCCCAGC T T T G T T T GAGAT CC T GAA 

AAT CT T GAAGAAGGTAT T CC TCACCT T TCT T GCC T T TACCAGACACT GGAAAGAGAATACTATAT 

T GCTCAT TTAGCTAAGAAATAAATACATCTCATC TAACACACACGACAAAGAGAAGC TGTGCT TG 

CCCCGGGGT GGGTATC TAGC TC T GAGAT GAACTCAGT TATAGGAGAAAACC T CCATGC T GGAC TC 

CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



U! 



FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNET S T SANTGS S VI S S GAS TATNSGS SVTS S GVS TAT I SGS 
S VT SNGVS I VTNS EFHTTSSGIS T ATNSE FS T AS S G I S I ATNSE S S TT S S GAS TATNS E S S TP S S 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
p STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
P AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

m 



AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 



fH amino acids 510-532 

6 

£11 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAAC TATCAGGC T GAAAAACAGAGTGGGTAC TC T CT TCTGGGAAGCTGGCAACAAAT 
GGAT GATGTGATATATGCAT TCCAGGGGAAGGGAAAT TGTGGTGCT TCTGAACCCATGGTCAAT T 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 

GG T GT CATGGAAAGGGAT TTACTT TATACT GACTC T GT T T TGGGGAAGC T T T T T TGGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

GAT TAT AACT GGGGAT GCAT T T GT T CCT GGAGAAAGAAGT GT CAT TAT CAT GAACCAT CGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCAT TCATAGGAAATGGAAGGATGACAAGAGCCATT TCGAAGACAT GAT TGATTAC TT TT GT G 
ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
TCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

CCT TCAAC T CTGGT GC CACAAACGGT GGGAAGAGAAAGAAGAGAGGCT GCG T TCC T TC TATCAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATT TGTACAGTCT TGT TAAGT GGTATT TTATAATCACCATTGTAAT CTTT GTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T T AAAT T CAAAGAAAAAT GAGTAAGAT T AT AAGGT T T GCCATGTGAAAACCTAGAGCAT AT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CAT TAT T T GT T AAAGAT AT T T T GCACT TAAT T T TGTGGGAAAAATAT TGC TACAAT T T T T 
T T TAAT CT C T GAATGTAAT T TCGATACTGT GT ACATAGCAGGGAGTGAT CGGGGT GAAATAAC TT 
GGGCCAGAATATTAT TAAACAAT CATCAGGCTT TTAAA 



FIGURE 102 



MHSRGRE I WLLNPWS INEAVS SYCTYFIKQDSKS FGIMVSWKGI YFILTLFWGS FFGS I FMLS P 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGW^^ 

LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLVWLLSILYWTLFSPMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LE I IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 

T CCAAAT CAT CCAT CCAC CCC TGCTGT CAT CT GT T T TCATAGT GT GAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC CT CT ACAGAGAT GGGGAAGAC T GGGAAT C TAAGCAGATGCCACAGT AT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T T GT CT TCAGAC T CCAGAGCAAATGCAGATGGGTACAGCCT GT AT GAT GT GGAGAT C T CC 
AT T AT AGT CCAGGAAAAT GCTGGGAGCATAT T GTGT TCCATCCACCT TGCT GAGCAGAGTCAT GA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAAT C CAGGCGGAACT GGAC TGGAGAAGAAAGCACGGAC AGGCAGAAT T GAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T TT CT GAT CT GAAAACT GT AACCCATAGAA&AGCT CCCCAGGAGGT GCC T CACT C TGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACAT GT CAG TT TGAAGGC T T GT T GAGAC CC TATATCCAGCAT GCGAT G T ATGACGAGGAAAAG 
GGGAC TCCCATAT T CATATGTC CAGT GT CC T GGGGATGAGACAGAGAAGACCCTGC T TAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCAT GGC T GCCC T GAAGT GGGGACGGAATAGAC T CACAT TAG G T T T AGT T TGT GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGT GATT CCTGCCTCACAGGTGAAGATTAAAGAGACAACGAATGT GAATCATGC TTGCAGGTT 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
TGCT T T GT GAGGC TCAAAGAATAAAGAGGAGGTAGGAT TT T TCACT GAT TCTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCAT TAACACAGACACAAAAATTCTAAA.TAAAATTTTAACAAATTAAACTAAACAATATAT T TA 
AAGAT GAT AT ATAAC TACT CAGT GTGGT T T GTCCCACAAAT GCAGAGT TGGT T TAATAT T TAAAT 
AT CAACCAGT GTAAT T CAGCACAT T AATAAAGTAAAAAAGAAAACCAT AAAAAAAAAAAAAAA 



FIGURE 1Q4 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS I HLAEQSHE VES KVL I GET FFQPS P WRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNWTLSPNNGYWVLRLTTEHLYFT 
FNPHFI SLPPS TP PTRVGVFLDYEGGT I S FFNTNDQSL IYTLLTCQFEGLLRPY IQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 1Q5 



C C TT CACAGGAC TCT T CAT T GCTGGT TGGC AATG AT GT AT C GGCCAGAT GTGGTGAGGGCT AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
AC ACG AAG AAGT AAAACT CT AGGT CAGAGT CT C AGG AT CGTT GGTGGGAC AG AAGT AGAAGAGGGTGAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCT T CGAAAATG AAACGGGGTCTCCGGAGAAT AAT T GT CC AT GAAAAATACAAACACCC ATC AC A 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 

GATGGT T AC AGT CAAAAT CATCTT CGAC AAGC ACAGGTGACT CT CAT AGACGCT AC AACT TGCAATGAACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
GATTACTTCAAAAACTGGTATCJAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 

AT T AC AGCCT GT AT T C AT TTGT TCT CTAGAAGT T TT GTCAGAATTT TGACTT GT TG AC AT AAAT TT GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT AT T AGGTGTT TT TCTT AGT G GAAT AT T AGAAATGAT CAT ATT CATT AT GAAAGGTC AAGC AAAGAC A 
GCAGAATACC AATCAC TTC ATC AT T T AGGAAGT ATGGGAACT AAGT TAAGGAAGTCCAGAAAG AAGCC AAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
T T ACT GAGGATGTC AAC AT AT AAC AAT AAAAT AT AAATCACC C A 
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FIGURE 106 



MMYRPDWRARKRVCWE PWV I GLVI FI S L I VLAVC I GL TVHYVRYNQKKT YNY YS T L S FT TDKL Y 
AE FGREASNNFTEMS QRLE SMVKNAFYKSPLREE FVKS QVI KFS QQKHGVLAHMLL I CRFHS TE D 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRI I VHEKYKHPS HDYDI S LAE LS S PVPYTNAVHRVCLPDAS YE FQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
AT AATGT T CT CAC T T TAACTGCAGAAACGAGGGTAGAGGAAGCAGT CATT T T GACT TACT TT CC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGT CACT T T GAAAGCCAGGAT GACAAAT TAT GGAT TACCT AGAT AT CGGT GGCT TAC TCA 
T GCT TGGAAT T T T T T TCAGAGAGAGT T TAAGTGCT GT GGAGTAGT ATATT T CAC T GAC TGGTTGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
T T T GAGAGGAACCAAACAACTGCAGGTGCT GAGGT T T C T GGGAATCT CCAT TGGGGT GACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCC T TGAAGAATGACAAC TC T CAGCACCT GTCAT GT CCC T CAGTAGAAC T 
GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
TTGAGATGGAGGAGT TATAAAAAGAAATGTCACAGAAGAAAACCACAAACTTGT TTTAT TGGACT 
TGTGAAT T T T T GAGTACATAC TATGTGTT TCAGAAATAT GTAGAAATAAAAATGT TGCCATAAAA 
TAACACC TAAGCATATAC TAT T CTAT GCT T T AAAATGAGGATGGAAAAGT T T CATGTCATAAGTC 
ACCACCT GGACAATAATTGATGCCCT TAAAATGC TGAAGACAGATGTCATACCCACTGTGTAGCC 
T GT GT AT GACT T TTAC TGAACACAGT TATGTT TTGAGGCAGCAT GGT TTGATTAGCATTTCCGCA 
T CCAT GCAAAC GAG TCACATAT GGT GGGAC TGGAGCCATAGTAAAGGT T GAT T TACT TC TACCAA 
CTAGTATATAAAGTACTAATTAAATGCTAACATAGGAAGT TAGAAAATACTAATAACTT TTATTA 
CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
ACCTAAAT GTGATTT TTGCTGGT TACTAAAATATTCT TACCACTTAAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T T CAGT T CT GATAAT GT TAAGAATAACCAT TAT GAAAAGGAAAAT T TGT CC T GTATAGCAT CAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T TAAAT AC T TAAC CAC TAAT T T T GAAAAT TAC CAG TGT GAT AC AT AGGAAT CAT TAT T C 
AGAAT GTAGTCTGGTC TT TAGGAAGTATTAATAAGAAAAT TTGCACATAACTTAGTTGATTCAGA 
AAGGACT T GT AT GC TGTT T T TC TCCCAAAT GAAGAC TC TT T TT GACAC T AAACAC T T T T TAAAAA 
GC TTATCT T TGCC T TC TCCAAACAAGAAGCAATAGT C T CCAAGT CAATAT AAAT T CTACAGAAAA 
TAGT GT TC T T T T TCT CCAGAAAAATGCT T GT GAGAATCAT TAAAACAT GT GACAAT T TAGAGATT 
CTTTGTTT TAT T T CACTGAT TAATATACT GT GGCAAATTACACAGAT TAT TAAAT T TT TT TACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATATT TTCTAGAGAGTAA 



FIGURE 108 



MAREDSWCLRCLLYALNLLFW1MSISVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPWHP 
VMIAVCCFLIIVGMLGYCGTVKRNLLLLAWYFGSLLVIFCVElACGWTYEQEmVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQRE FKCCG WYFTDWLEMTEMDWPPDS CCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 



If! amino acids 12-35, 57-86, 94-114, 226-248 

III . 



U 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 

GAT AAAGT CT T TCT C GT GAGAGC C T AG AGGC C T T AAAAAAAAAAGT GC T T GAAAGAGAAG GGGAC AAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACT GCTGT CTCT GGAACACCAC GGT GGAAGTC AAGGCT T GCCCTGGAGGCT ACT ATGT GTATC GT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
IP ATGAAAATGAAT GT GAGCAAAACAACGGTGGCT GCAGT GAGATCT GT GT GAACCTCAAAAACTCCTACCGCT GT 

III GAGTGT GGGGT T GGC C GT GT GCT AAGAAGT GAT GGC AAG ACT T GT GAAGAC GTT GAAGGAT GCC AC AAT AAC AA 

II I TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
Hi CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
1:5 CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
* CCTCTTCTCTCT C AAGACAT GT GGT AC AGT G GT C G ATGT GGT G AAT G ACAAGAT T GT GGC C AGC AACC T C GT GA 

|j{ CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
J:!! ACG T GC GAGTT T CC ACGCC T GT ACAC CAT T T CT GAAGGAT AC GT T CC CAACC TTC GAAAC T CCC C ACT GGAAAT 

1" = CAT GAGCCGAAATCATGGGATCT TCCC ATT C ACTCT GGAGATCTTCAAGGAC AAT GAGTT T GAAGAGCCTT ACC 

f II 

0 GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 

TTGGAAAGCTTGGT GGAGAGCTGCTTT GCCACCCCCACCTCCAAGAT CGACGAGGTCCT GAAAT ACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
T GGGT T C AGAC T TC AC AC T GT GAGT TC AGAC T C CC AGCAC C AACT CACTCT GATT C T GGT CC AT TC AGT GGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAATCAGAAGCT GGGTATAATATTTCAAGTT ACAAACCCTAGAAAAAT TAAACAGT TACT GAAATT AT GA 
CT T AAAT ACCC AAT GAC T CCT T AAAT AT GT AAAT T AT AGT T AT ACCT T GAAAT T T C AATT C AAAT GC AGAC T AA 
T TAT AGGGAATTT GGAAGT GTATCAAT AAAACAGT AT AT AATT TT 
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FIGURE no 



I ■ J 

III 

St 

m 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPVVHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Tyrosine kinase phosphorylation sites. 

C| amino acids 411-418, 443-451 



k! N-myristoylation sites. 

ni 



amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



G AGAGAGGC AGCAGCTTGCT CAGCGG AC AAGG AT GC TGGGCGTG AGGGAC C AAG GCCTGC CCTGC ACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCT G CT GCCT TGGGGT GACAAT CT C AGCTCCAGGCT AC AGGG AGACCGGGAGGATC AC AGAGCC AGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AG AC CT TC AGAAAGGT GGGGAT CCCC AT CAT CAT AGC ACT ACTG AG CCTGGCGAGT AT CATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGC AGCT GT GT GACGGAGAGC T G GACT GT CCCT TGGGGG AGGACGAGGAGC ACTGT G TC AAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCT AC AGC AGAGCT GT GG AG AT TGGCCC AG ACC AGG AT CT GG AT GT TGT T GAAATC ACAGAAAAC AGC CAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AGCAT CC AGT ACG ACAAACAGC ACGT CT GTGG AGGG AGC AT CCT GG ACCCCC ACTGGGTCCTC ACGGC A 
GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
CTTCCC AT CC CT GGCT GT GGCCAAGAT CAT CAT C AT TGAATT C AAC CCCATGT ACC CC AAAG AC AATG ACA 
P TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
111 GATGAGGAGC TC ACT CCAGCCACCCC ACTC TGGATC AT T GGATGGGGC TTTACGAAGC AGAAT G GAGGGAA 

Si GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 

ACC AGGGGGAAGTC ACCGAGAAGAT GAT GT GTGCAGGC AT CCCGGAAGGGGGT GTGGACACCT GCC AGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CCCC CAAAGT CAGACACAGAGC AAGAGT CC CCTT GGGT AC ACCCCT CT GCCC AC AGCC TC AGC AT 
TTCTTGGAGC AGCAAAGGGCCT C AAT TCCT GTAAGAGACCCT CGC AGC CC AGAGGCGCCC AGAGGAAGT C A 
GC AG CC CT AGCT CGGCCACACT T GGT GCTCC C AGCAT CC C AGGGAGAG AC AC AG CCCACT GAAC AAGGTC T 
CAGGGGTATTGCTAAGCCAAGAAGGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCC 
CAGAT C ACT GTGGGCT GG AG AGGAGAAGGAAAGG GT CTGC GC CAGCCC TGT CCGTCTT CACCCATCCC C AA 
GC CT AC TAGAGC AAGAAACC AGTTGT AAT AT AAAAT GC ACTGCCCT ACT GTT GGT ATGACT ACC GT T ACC T 
ACT GT T GT C ATT GT T ATT AC AGCT AT GGCCACT ATT ATT AAAGAGCTGT GT AAC ATC T CT GGC AAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRI PMET FRKVGI P 1 1 IALLSLAS 1 1 IVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KT PRWGGEE AS VDS WPWQVS I QYDKQHVCGG S I LDPHWVLTAAHC FRKHT DVFNWKVRAGS DKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 11a 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
T GAACT GCTAGGAT TC TGAC TATGCT GTGGT GGCTAGTGCTCCTACT CC TACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
M GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
0 CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
||1 ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
III CAGAAGGGGTC TGCTTAGACCACCT GGTT TATGTGACAGGACT TGCATTCTCCTGGAACATGAGG 

GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAAC TTGT GCCAAAT TAT GGGT CAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 



m 

m 

k 



P 

||| CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 

§1* 

M 



CCACGT ACCAGC TGC TGAAGGGCAACT GCAGGCCGAT GC T CTCAT CAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GATC TCCAAGGGCCCTTATACCCCAGGAGAC T T TGATTT GAAT TT GAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
T T TGGGAGGCCGAGGCGGGTAGAT CACCT GAGGT CAGGAGT TCAAGACCAGCCTGGCCAACAT GG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGC TACT CGGGAGGCTGAGACAGGAGAAT TAC TT GAACCTGGGAGGTGAAGGAGGCT GAGACA 
GG AGAAT C AC T T C AGC C T GAGC AAC AC AG C G AGAC T C T G T C T C AG AAAAAAT AAAAAAAGAAT T A 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPIAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGT GGT CT CT CAGT CCTCT CAAAGCAAlGGA?\AGAGTACT GT GT GCT GAGAGACCATGGCAA. 
AGAATCC T CCAGAGAAT TGTGAAGACT GT CACATT C TAAATGCAGAAGC T TT TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T CTACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAATTGATCCT GTGACCAGAACTGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T GAACCAGAAGAGGAAAT AGAT GAGAAT GAAGAAAT T AC C AC AAC T T T C T T T GAAC AG T C AG T G 
AT T T GGGT CCCAGCAGAAAAGC CT AT T GAAAACCGAGAT T T T CT T AAAAAT T CC AAAAT TC TGGA 
GAT T T GT GAT AACGT GACCATG TAT T GG AT CAATCCCACTCTAATAT CAGT T TC TGAG T TACAAG 
ACT TT GAGGAGGAGGGAGAAGATCT T CACT TTCCTGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAAC T T C C AAT AAAT G AC TAT AC T GAAAAT GGAAT AGAAT T T GAT C C CAT G C T GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT CACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I TTT FFEQS VI WVPAEKP IENRDFLKNSKI LE ICDNVTMYWINPTL I SVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 
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GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
T ATT T C ACC ATCCT GGGACT TCC AGCC AT GCT GCAGGC AGTGCG AGCCCT GATGAT CGTAGGCATCGT CC T 
GGGT GC CAT T GGCCT CCT GGTATCCATCTT TGCCCT GAAAT GCATCCGC ATTGGC AGC AT GGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CT TCC AAGCACGACT ATGTG TAA TGC TC T AAGAC CT CT C AGC ACGGGCGGAAGAAACT CCCGGAGAGC TC A 
*..f CCCAAAAAAC AAGGAGATCCCATCT AGATTTCTTCTTGCTTT TGACTCACAGCT GGAAGT TAGAAAAGCCT 

g CGATTT CAT CTT TGGAGAGGCCAAAT GGT CT T AGCC TC AGT CTCTGT CTC TAAATAT TCCACCATAAAAC A 

|j| GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
III T T CT ACT CTG AT GAGAGAAT GT GG TT TT AATCT C T C TCT C AC AT TT TG AT GATT TAGAC AGACTCCCC CT C 

CI TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
W GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

M. AC AC TT ACT G AAGAAGAAGC AAT AAGAG AAAG AT AT TTGT AAT CTCT C C AGCCC AT GAT CTCGGT TTT CTT 

fl I 

ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACAT CTT CTT ATT AC AGCAAC ACCATT CT AGGAGTTT CCT G AGCT CT CC AC TGGAGT CC TCTT TC TGT 
CGCGGGT CAG AAATT GTCCCT AGAT G AATGAGAAAAT TAT TT T T TT T AAT TT AAGT CCT AAAT ATAGT T AA 
AATAAAT AAT GT TTT AGT AAAATGAT AC AC TAT C TC T GT GAAAT AGCCTC ACCCCT ACAT GT GG AT AG AAG 
GAAATGAAAAAATAATTGCTTT GACATTGTCTATATGGT ACTTTGT AAAGTCAT GCTTAAGT AC AAATTCC 
AT GAAAAGCT C ACACCTGT AAT CCT AGC ACT TT GGGAGGCTG AGGAGGAAGGAT CACT T G AGCC C AGAAGT 
TC GAGACT AGCCT GGGCAAC AT GGAGAAGC CCT GTC TCT ACAAAAT AC AGAG AGAAAAAATCAGCC AG TC A 
TGGT GGCATACACCTGTAGTCCCAG CAT TC CGGG AGGCT GAGGTGGGAGG AT CACT T G AGCCC AGGGAGGT 
T GGGGC TGCAGT GAGC C ATGAT CACACCACTGCACT C CAGCC AGGT GAC AT AGC GAGATC CTGT CT AAAAA 
AATAAAAAAT AAAT AATGGAACAC AGC AAG TCCTAGGAAG TAGGT T AAAACT AAT TCT TT AA 



App_ID=10063537 



Page 267 of 3 18 



FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT I LGLPAMLQAVRALM I VG I VLGAI GLLVS I FALKC I RI GSMEDSAKANMTLTSG IMFI VS GL 
CAI AGVSVFANMLVTNFWMS TANMYTGMGGMVQTVQTRYTFGAAL FVGWVAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 

AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 

GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 

GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 

CAT CAGGATGCAGTGCAAAATCTAT GATT CCCTGCT GGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 

GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 

ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 

CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 

ACT CAATAGTGAATGTTGCCCAAAAACGT GAGCTTGGAGAAGCTCTCT ACTTAGGATGGACCACGGCA 

CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

CAGATACT CGAT ACCT T CCCATC GCACAACCCAAAAAAGTT ATCACAC CGGAAAGAAGT CACCGAGCG 

TCTACT CCAGAAGTCAGTAT GTGTAGTTGTGTATGT TTTTT TAACTTT ACTAT AAAGCCAT GCAAAT G 

ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 

AAT C T TAAT T AC AGGAACT GT GC ATCAGC TAT T TAT GAT T CTAT AAGCTAT T T C AGCAGAAT GAGAT A 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

ACGAC AT AGCAT TAT GT AC ATAGAT GAGT GTAACAT T TAT AT C T CAC AT AGAGACAT GC T T ATAT GGT 

TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 

ATCATGGATAGGGTTGAAGAAGGTTAC TATTAATTGTTT AAAAACAGCT TAGGGAT TAAT GTC CTCCA 

T TT ATAAT GAAGATTAAAAT GAAGGCT TTAAT CAGC ATT GT AAAGGAAATT GAAT GGCT T T CT GAT AT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 

C TT G T GT ATT AAAT T AACAT T T T T AAAAC GCAGAT AT TT T GT CAAGG GG CT T T GC AT T C AAAC T GCT T 

T TCCAGGGCTATACTCAGAAGAAAGAT AAAAGT GTGATCTAAGAAAAAGTGATGGT T TTAGGAAAGTG 

AAAATATTTTTGTTTTT GTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATAT GTATGGAT 

ATAT TTT AAT AAGTATT TGAGTACAGACT TT GAGGT T TCAT CAAT ATAAATAAAAGAGCAGAAAAATA 

TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 

GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 

CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 

ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 

TAT ACAT T TATATTAATAAATTGT ACATTTT T CTAATT 
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FIGURE 120 



MATHALE I AGL FLGGVGMVGTVAVTVMPQWRVSAFIENN I WFENFWEGLWMNCVRQANI RMQCK 
I YDSLLALS PDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAH I LLTAG 1 1 FI I T G 
MWLI PVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVL I VGGALFCCVFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 

CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 

GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 

TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 

CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 

TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 

GAT TCAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGC T T TGAGGAGT CC TGGACACCCAACTAC 

AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 

TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 

GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 

AT T G AAG C T AT AAT T TAT T T G G AC C AAG G AAGC C C T GAAAT GAAT T C AAC AAT T AAT AT T CAT C G 

CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 

ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 

T T GG AAT GGT T CAC T TAAAT GACAT T T T AAAT AAGT T TAT GTATACAT CT GAAT GAAAAGCAAAG 

C T AAAT AT G T T T AC AGAC C AAAG T G T GAT T T C AC AC T G T T T T T AAAT C TAG CAT TAT T CAT T T T G 

CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 

CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

AAAAAATATAAAAGCTACCAAT CTTT GTACAAT TTGTAAATGTTAAGAAT T TTTT TTATATCTGT 

TAAAT AAAAAT TAT T TCCAACA 



FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVT PGT PGI PGRDGFKGEKGECLRES FEESWTPNYKQCSWS SLNYG IDLGKIAECT FTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 

CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 

AT T GCCT GGAAGAATACATCATGT TT T T CGATAAGAAGAAATT G T AGGAT CCAGT T T T T T T TT TA 

ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 

TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 

TGTTCTGCGTGGCTGGGAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 

TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 

GC CCCTAAGC CAAAGCAAAAGACCT AAGGACGACC T TT GAACAATACAAAGGATGGGT TTCAAT G 

TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 

CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 

ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 

AT AACAGCCT T CAAAAAC T TAAGTATAAT C AAT T TAAAGGGC T CAACCAGCT CACC T GGC TAT AC 

CT T GACCATAACC ATAT CAGCAATAT TGACGAAAAT GCT T T TAAT GGAATACGCAGAC TCAAAGA 

GCT GAT T C T TAGT T CCAATAGAAT CT CC TAT T T TC T T AACAAT ACCT T CAGACCT GTGAC AAAT T 

TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 

CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 

AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 

T CT T TGCT GGCATGATCAGACTCAAAGAACTT CACC TGGAGCACAAT CAAT T T T CCAAGC T CAAC 

CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 

CAT AGGACAGACCAT G T CC TGGACCT GGAGCT CCT TACAAAGGC T TGAT T TAT CAGGCAATGAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 

TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 

TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 

T T AAAGG T C TAAGGGAGAATACAAT TATC TGT GCCAGTCCCAAAGAGCT GCAAGGAGTAAATGT G 

AT CGATGCAGT GAAGAAC TACAGCATCTGT GGCAAAAGTAC TACAGAGAGGTT T GAT CT GGCCAG 

GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 

TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 

TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGAC TCCCAGCACCCAGGAAT T T TATGTA 

GAT TATAAACCCACCAACAC GGAGACCAGC GAGATGCTGC T GAAT GGGACGGGACCC TGCACC TA 

T AACAAATCGGGCTCCAGGGAG TGT GAGGTATGAAC CAT T GTGAT AA&AAGAGC T C T TAAAAGCT 

GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 

TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 

GGTCATTT TCC TCTCATACATAAT CAACCCATTGAAAT TTAAATACCACAATCAATGTGAAGCTT 

GAACTCCGGTT TAATATAATACCTATTGTATAAGACCC TT TACT GATTCCAT TAATGTCGCATTT 

GT T T TAAGATAAAAC T TC T T TCATAGGTAAAAAAAAAAA 



FIGURE 124 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
IARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 

WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNY S I CGKS TTERFDLARAL PKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGT TGGGAGACGGTGCAAGAGAAT CTGC CC CC TATAGGGGAAT GGT GC GCACAGCCC TAGGGAT C 
ATT GAAGAGGAAGGCT T TCTAAAGC T T T GGCAAGGAGT GACACCCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AG CC AAT C C AAC T GAC CT AG T GAAG G T T C AGAT GC AAATGGAAGGAAAAAGGAAAC T G GAAG G 
AAAACCAT T GCGAT TTCGTGGT GT ACAT CAT GCAT T TGCAAAAAT CT TAGC TGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTT ATGATACAGTGAAACACTACT TGGT AT TGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAAT GAAT C AACC ACG AGATAAACAAGGAAGGGGAC T T T T GT AT AAATC AT CG 
ACT GACT GCTTGATTCAGGCTGT TCAAGGTGAAGGATTCATGAGT CTATATAAAGGCT TT TTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
T GAG TGGAGT CAGT CCAT T TTAA 
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FIGURE 126 



MSVPSEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWK^ 
VIGGl^GVIGQFI^PTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKILAEGGIRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 

CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 

AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 

CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 

GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCGGGGACTTTGATCCCCTACAGATGTGACCTAT 

CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 

TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 

AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 

CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 

ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CATCCAGAT TGGAGACATCCAGATGAGGC CCACGGAGCAGGT GACC TAGT GACTGTGGGAGC T CC 

TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 

TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 

AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 

AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACACCAGGGCCTGGCCCAGTGGATT TCATGGTGATCAT TAAAAAAGAAAAATCGCAACCAA 

AAAAAAAAAA 



FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVA 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIARPDTLLS 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PG WE T Q FAFKLHDKDPEKAAAT YE QMKC L K PE DVAEAVI YVL S T PAH I Q I GD I QMR P T E QVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short- chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGGCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAAT TT TTAACTGATAGT TGTACATAT TTGGGGGT ACATGTGATATT TGGATACATGTATACAA 
T AT AT AATGAT CAAATCAGGGTAAC T GGGAT ATCCATCACAT CAAACAT T TAT T T TT TAT T C T T T 
T T AGACAGAGTC TC AC TC T GT CACC CAGGCT GGAGTGCAG T GGT GC C AT C T CAGCT TAC TGCAAC 
C T C T GCCT GCCAGGT TCAAGCGATT GT CAT GCCTCCACC T CCCAAGTAGC TGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCT CT GAT CACC TCAT T CTAC T C T C TACC TCCAT GAGAT CCAC T T T T T T AGCT CCC ACAT GT G 
AGTAAGAAAATGCAATAT T TGTC T T T CT GT GCC TGGC T TAT T T CAC T T AACATAAT GACT T CC T G 

T TCCATCCATGTTGCTGCAAATGACAGGAT TTCGTTCT TAATTT CAAT TAAAATAACCACACATG 

GCAAAAA 



FIGURE ISO 

MGLLLLVLFLSLLPVAYT IMSLPPS FDCGPFRCRVSVAREHLPSRGSLLRGPRPRI PVLVSCQPV 
KGHGTLGES PMPFKRVFCQDGNVRS FCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature* 

amino acids 68-79 
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FIGURE 131 



TT CT GAAGT AAC GGAAGCT ACC TT GT AT AAAGAC CT CAAC ACT GCT GACC ATGATC AG CGCAGC CTGG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGG AT GCT ACAACTCTC T ACCTT CAGAAC AACCAAAT AAAT AAT GCT GGGATT CC TTCAGAT 
TTGAAAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCCAAAGT AT GT AAAAGAGT T ACATT TGCAAG AAAATAAC AT AAGG ACT ATC ACTT AT GAT T CACTT T CAA 
AAATT CCC T ATCTGG AAG AATT AC AT TT AGAT GACAAC TCTGTCT C TGCAGT T AGC AT AG AAGAGGG AGC A 
TT CCGAGAC AGC AACT AT CT CCGACT GCT T TT CCT GTCCCGT AATC ACCT TAGC AC AATT CCCT GGGGTT T 
GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 

GT CTCACT AGTCTAAAACGCCTGGTT CT AGAT GGAAACCTGT T G AACAAT CAT GGT TT AGGT GACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATG AT T T GGAC AAT ATAAC AC AACT GAT T CT T CGC AACAAT CC CTGGT AT TGC GGGT GC AAGATG AAAT G 
GGT ACGTG AC TGGT T AC AAT CACT AC C T GT GAAGGT CAACGT GC GTGGGCTC AT GT GCCAAGCCCC AG AAA 
AGGT TC GT GGGATGGC T AT T AAGGATCT C AAT GC AG AACT GT TT GATT GT AAGG AC AGT GGG ATTGT AAGC 
ACC ATT CAGAT AACCACT GC AATACC C AAC AC AGTGT AT CCT GCCCAAGG ACAGTGGCC AGCTCCAGT GAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
ACTGCT TTGAGACTC AGC TGGCTT AAAC T G GGCC AT AGC C CGGC AT TT GGATCT AT AACAGAAAC AAT TGT 
AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CC AT GG AAACCAGCAACCTCT ACCT ATT TGATGAAACTCCTGTT TGT AT T GAGACT GAAACT GC ACCC CTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T TCATAGG AATG GAT C GCT CTT CT CAAGGAACT GTGCAT ATAGC AAAGGGAGGAGAAG AAAGG ATG ACT AT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 

AAGCAAT GAACCCAT CTCGAAGGAGGAGTT T GT AAT ACAC AC CATATT T CCT CCTAAT GG AATGAAT C TGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TC AC ACTC ATGATGCT GAAG GACT C AC AGCAGAC TT GT GT TT TGGG TT T TTT AAACCT AAGGG AGGT GAT G 
GT 



FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EE LHLDDNSVSAVS IEEGAFRDSNYLRLLFLS RNHLS T I PWGLPRT IEELRLDDNRI S T I S S P SL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMS^LSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVST IQITTAI PNTVYPAQGQWP APVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
T GC AGAGGCAGTC T GGGC T T GGCC AGAGC T C AGGGTGC TGAGC GT GT GACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
C TCCAC TACC T CAAGCTGAGTGATCCCAAGTACCTAAGAGAGT T CCAGCT GACCC TCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

CCCAGGACTCAT TCT CAGAGGAGAGAAGT GACGTGT GCCTGGT GCAGCTGC T GGGAACCGGGAC G 

GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 

AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 

GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 

AGAGCTGAGGGCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 

TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 

AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 

TATCAGCAGCATTTTTCGAGGAGAGTGAAGAGGCGAGAAAAACAATTTGCAGATTCTCGCTCTGT 

TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 

AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 

TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 

CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 

GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 

ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 

GT GGC T T CC TATACAT CCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACC G 

CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 

T T TAGT CC T CATCCC T TAGATCC T GGAGGGC ACGGAT C ACAT C CT GGGAAG AAGGC ATC T GGAGG 

ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 

GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 



FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATCGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGAT TTGATCCCGTT CGAGTGGATATCACTTCGAAAGGAAAAAT GAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 
GT T T C TGAGTT CATGACAAGACT CT T CT CT TCAAAATCAT C TGGCAAAT CT AGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAAT AT T GATGTCAC TGAAT T AAT T ACAG TGTCCTATAGAAAAT GCCAT TAATAAAT TAT 
AT GAACTACT ATACAT TAT GTATATT AATT AAAACAT CT TAATCCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



Ill 



HI 



FIGURE 1*6 

MZU\ALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
WRLPYPLQMKS S GPP S Y F IKRES WGWT D FLMNPMVMMMVLPLL I FVLLPKWNT SDPDMRREME 
QSMNMLNSNHEL PDVSE FMTRLFS SKS S GKS S SGS SKTGKS GAGKRR 



Important features of the protein: 
Signal sequence: 

amino acids 1-23 



0 
. D 

0! Transmembrane domain: 



amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 



U3 Glycosaminoglycan attachment sites. 

P\ 

:H amino acids . 37-40, 236-239 

I'll 
m 

"rf j cAMP- and cGMP -dependent protein kinase phosphorylation site. 



amino acids 151-154 
N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 

T GCAAGCCAAAGATGGAAACAT T GACAT CAGAAT C T TAAGGAGGACT GAGT CT T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCAT CAAGAAGGACC TCCGGCT CT CT CATGCC CACATGACATGCCAT T GT GGGGAGGAAGCAAT G 
AAGAAATACAGCCAGATT C TGAGT CAC T T T GAAAAGC TGGAACCT CAGGCAGCAGT TGTGAAGGC 
TT TGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGT GATGCTG 
CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
T GTAATATC TT TCTGC TAT TGG ATAT AT T TAT T AGT T AAT ATAT T TAT T TAT T T T T T GC TAT T T A 
AT GTATTTATTTTTT T AC T T G GACAT G AAAC T T T AAAAAAAT T C AC AG AT TAT AT T T AT AAC C T G 
ACTAGAGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT T T GAAAT T GAACCAATGAC TACT T AGGAT GGGT T GT GGAATAAGT T T T G AT GTGGAAT T GCAC 
AT CTACCT TACAAT TACT GACCATCCCCAGTAGACTCCCCAGTCCCATAAT TGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 

AAAAAAAAAAA 



FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSRE AWQ FFLLLWS PDFRPKMKAS S LAFSLLS AAFYLLWT PS T GLKT LNLGS C VI ATNLQE I RNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SIANSFLTIKKDLRLSHAHMTCHCGEEMKKYSQILSHFEKLEPQAAVVKALGELDIL 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 

CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T TT G T GGC TGAAGCAAAGCCACC T ACAGT TAC TAT GCCTCGAAT CAAGGCATT GATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTCT T T TAT GGAGATGAAACCT GGGT TAAAT TA 
T T CCCAAAGCAT T T T GT GGAATATGATGGAACAACC T CATT TT T CGT GTCAGAT TACACAGAGGT 
M GGATAAT AAT GT CACGAGGCAT T TGGATAAAGTAT TAAAAAGAGGAGAT T GGGACAT AT TAATCC 

Pi TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
»-* AAGCT GAGCGAGATGGACAGCGTGCT GATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 

•;p GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
"M| GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
III CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
!|1 AC T T GGC T T ACCGAT T CCAAAAGACAGT GT AGGGAGCCTCC TAT TCC CAGT T GT GGAAGGAAGAC 

Ul CAAT GAGAGAGCAGT TGAGAT T T T TACAT T TGAATACAGTGCAGC TT AGTAAACT GT T GCAAGAG 

*'{ AAT GT GCC GT CATATGAAAAAGAT CC TGGGTT T GAGCAGT T TAAAAT G T CAGAAAGAT T GCAT GG 

^ GAAC T G GAT C AGAC T G TAC T T G GAGGAAAAGCAT T C AGAAG TC C TAT T CAAC C T G GGC T CCAAGG 

B TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
p TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Iff CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
pi CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
!j* GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
W AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
|;| TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
fif CAAGGTGAT TGTAAAGAGC TGGCGGT CACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGAT CCACAGAGCCCAGGAT CAA.GGGACCCACTGCAGTGGCAGCAGGAC T GT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGC TT TGTCATCAGGGCACAGGC TTCCT CGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 14Q 



rii 
in 

f!l 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKVVIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 



M Signal peptide: 

amino acids 1-34 

0! 

w 
m 



Transmembrane domain: 

amino acids 58-76 



N-glycosylation sites. 

pi amino acids 56-60, 194-198 

Si 

fj| N-myristoylation sites. 



amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCC TT CCAGG T TATCGT GACGCACCT T GAAAGT C T GAGAGC TAC TGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTGAT 
AGCAGT T CCAGATAAAAACTACATACGCCCAGAGATC TT C T TT GCAT TAGC CT CATCC T TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAGT CATC CATCCC T T CAGC TGAAGAAGGAGAAACTGATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
jpt ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
p! G T T GGGGT GACAGATAAAT T TGAGAACAGGAAACACAT T GAAT T T T CATT T CAACCAGTT TGCAA 

III AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

P f 5 AT T T GAACT AAT TGTATAAAAACAC CAAACCT GC TCACT 

III 

M 

m 

Ill 



App_ID=10063537 



Page 291 of 3 18 



FIGURE 142 

MLLLLLEYNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDS GNLI AVPDKNYI RPE I 
FFALAS S LS SAS AEKG S P I LLGVS KGE FCLYCDKDKGQS HP S LQLKKEKLMKLAAQKE SARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 



N-myristoylation site. 

amino acids 50-55, 87-92 



m 

|l| Interleukin-l 

III amino acids 37-132 

III 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGAT GGCAGATGAGT GACT CCACAT CCAGAGCTGCCT CCC T TTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACAT CTT GAAT TCCTGGAAGGAGCAGGTGGAGTCCAAGACT GTAT TCTCAATGGAGC TACTGCT 
GGGGAGAACTAGGTGTGGGAAATTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
AGCT GAACAATACTT TCACCTGCTT CT TCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT TGAGCC TAATCATGT AGT GTAGAT CAT TAAACATCAGCAT T T T AAGAAAAAAAAAAA 
AAA?W\AAAAAAAA7U\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLS WALLLLLLGS QI LLIYAWH FHEQRDCDEHNVMARYLPAT VE FAVHT FNQQSKDY 
YAYRLGH I LNS WKEQVE S KTVFSMELLLGRTRCGKFEDD I DNCHFQES TELNNT FTCFFT I STRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 

C T GT GCAGC T CGAGGC TCCAGAGGCACAC T CC AGAGAGAGCCAAGG T T CTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
H CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
?| ; | AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
kj CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
If! TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

CCGTCACT CCAT TC TCAGCCCCTAGCAGAGCGTC TGGCACACT AGAT TAG T AGTAAAT GCT T GAT 

M 

GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
f*| GT GT AT TCCCG T T TTACAGATAAGGAAACT GAGGCCCAGAGAGC T GAAGT AC TGCACCCAGCATC 

HI ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACGCCAGGTTTTCTGCTCT 
P GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 

[jf CTCTAATGAAATTGTGAAAGCTCCATGTTTAGAAATAAATGAAAACACCTGA 

P 

ill 
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FIGURE 146 

MRKHLS WW WIJ^TVCMLL FS HL S AVQTRG I KHR I KWNRKAL PS TAQ I TS AQVAENRPGAF I KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 



AppJD=10063537 



Page 297 



FIGURE 148 

MFRSSLLFWPPLCLLSLFLL I LI SS I YSESCKLE I FHFACQWGRSLSLS FYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGT GCAGAAAGACCTT TGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT TGATAACTT GAACCTGCT AAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAAT CTA 
TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
ATCGAT GAT TATGACT CTACTAAGAGT GGATT GGAT CAT AAAT TT CAAGATGATCCAGAT GGT CT TCA 
T CAACTAGACGGGACTCCTT TAACCGCTGAAGACAT TGT CCAT AAAATCGCTG CCAGGAT T TAT GAAG 
AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT T TACAAAAATT AAT CT CAAAGGAAGCCAACAA 
TTAT GAGGAGGATCCCAATAAGCCCACAAGCTGGACT GAGAATCAGGCT GGAAAAATACCAGAGAAAG 
T GACT CCAATGGCAGCAATT CAAGATGGTCTTGCTAAGGGAGAAAACGAT GAAACAGTAT CTAACACA 
TTAACCTT GACAAAT GGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTT GAGGAACT CCA 
ATAT T TC CCAAAT TTCT ATGCGCTACT GAAAAGTAT T GATT CAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GATTACTAT CATGAAAACACTGATTGACTTTGT GAAGAT GATGGT GAAATAT GGAACAAT ATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATAT GGAAGCTT GAAGGATT CCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATT TATAGCAGCCTGTAAAAATGGCAAAAGAT CCAGGAGTCTTTCAA 
CTGT T TCAGAAAACATAATATAGCTTAAAACAC TT CTAAT T CT GT GAT TAAAATTTT TT GACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS I RS S PLDNKLNVEDVDS TKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKL I SKEANNYEEDPNKPTS WTENQAGKI PEKVTPMAAI QDGLAKGENDE T VS NTL TL TNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 




N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 



341, 369-372, 382-385, 386-389, 387-390 



N-myristoylation sites: 



fji 



amino acids 143-148, 239-244 
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FIGURE isi 

CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 

ATAATAACCAGC T T CTAGCTGGAGGGCT GCAT GCAGGGAAGGT CAT TAAAGGT GAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGT GGAGGAGACCCATGGCGGACAAT CAC TC T C TC T GCTC TCAGGACCCCCACGT CTGACT TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 

G GG T CAG T AGC T C T C CAC AT GAAGT C C T G T CAC T CAC CAC T GT G CAG GAGAG GGAGGT G G T CAT A 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAATGGCTCGAGCTCAGAAGATAAAAGATAAGTAGGGTATGCTGATCCTCTTTTAAAAACCCAA 
GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
AGACCTACTTACAAAGTGGCATATAT TGCAATT TATT TTAATTAAAAGATACCTAT TTATATATT 
T CT T TATAGAAAAAAGTC TGGAAGAGT TT ACTTCAAT T GTAGCAAT GT CAGGGT GG TGGCAG TAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 

AT TCCT T GT ATAAAAATAAGAAAAGAAAT T AAT C T T GAGGTAAGCAGAGCAGACAT CATC TC TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T T GTAGT AGTGAT CAGGAAACAGATCT CAGCAAAGCCAC TGAGGAGGAGGC T GTGC TGAG TT T GT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
G T C T C T GCAG AT G TAG T TAG T T AAG AC AAGG TCAT GC T GGAT G AAGG T AGACC T AAAT T C AAT AT 
GACTGGT TT CCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGATGAAGGCAGAGATCGGAGT TTTGCAGCCACAAGCTAAGAAACACCAAGGAT TGTGGCAACC 
ATCAGAAGCT TGGAAGAGGCAAAGAAGAAT TCTTCCC TAGAGGCT TTAGAGGGATAACGGCTCTG 
C T GAAACC T T AATC TCAGAC TT CCAGCCTCC TGAACGAAGAAAGAATAAAT T TCGGC TGT TT TAA 
GCCACCAAGGATAAT TGGT TACAGCAGCT CTAGGAAAC TAATACAGCT GCTAAAAT GAT CCC TG T 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGT TATAAAAGACACTGCAGCTT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGTGAACCT AGAAGCAGAGACT C T GT GAGATAAT C GAT GT TTGTTGTTTT 
AAGTTGCTCAGT TT TGGTCTAACT TGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 



FIGURE 152 

MVLSGALC FRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 

VRLT QLPENGGWNAP I TDFYFQQCD 
N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

CTTCAGARCAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATGG CCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGGCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGT TCGTCT CATT GGGGAGAAAC TGTT CCAC GGAGTCAGT AT GAGTGAGCGC T GC T 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
|J* TAT T GAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGC T GAAGGACACAGT GAAAAAGC 

ill TT GGAGAGAGT GGAGAGAT CAAAGCAATT GGAGAAC T GGAT T T GC T G T TT AT GT CTCT GAGAAAT 

Iff GCCT GCAT T TGA CCAGAGCAAAGCT GAAAAAT GAATAAC TAACC C CCTTTCCCT GCTAGAAATAA 

III CAATTAGATGCCCCAAAGCGAT T T T T T T TAACCAAAAGGAAGAT GGGAAGCCAAACTCCATCAT G 

if! 

r! ATGGGTGGAT TCCAAATGAACCCCTGCGT T AGTTACAAAGGAAACCAAT GCCACT TT TGTT TAT A 

111 

i'\ AGAC CAGAAGGTAGAC T T TC TAAGCAT AGAT AT T TAT TGATAACATT T CAT T GTAAC TGGT G T T C 

■<n 

s TATACACAGAAAACAATT TATTTTT TAAATAAT TGTC TTT TTCCATAAAAAAGATTACTTTCCAT 

p T C CT T TAGGGGAAAAAACCCCT AAAT AGCT T CATGT T TCCATAATCAGTAC T T TAT AT TTATAAA 

Hi TGTATTTATTATTATTATAAGACTGCATTTTATTTATATCATTTTATTAATATGGATTTATTTAT 

Hi AGAAACAT CAT T CGATAT T GCT ACT T GAGT G TAAGGC TAATAT T GAT AT TTATGACAATAAT TAT 

I'll 

gar* AGAGCT AT AACAT GT TTAT T TGACCT CAAT AAACAC T T GGATAT CCC 
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FIGURE 154 

MAALQKSVSSFLMGTLATSCLLLLA^ 

ADNNTDWLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
T CH I EGDDLH I QRNVQKLKDTVKKLGES GE I KAI GELDLL FMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGAC CCAGAT TAGGTGAGGACAGTTC T C T CAT T AGCC T T T T CCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCT GC T CTACCACAACCAGACT G TC T T CTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
C T T T AT T T AAAAAT G AAAAA 



FIGURE 156 

^ERPRLGEDS SL I SLFLQWAFIJU^^ 

RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 13 6-* 140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACC TCCGAGT AGAACC TGT TACAAC TAGT GT T GCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 

GAT T T GT GT GACGGGCAAAAGCAACT T CCAGT CC T ACAGC T GT G T GAGGT GCAAT TACACAGAGG 

CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 

GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 

CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 

AGTGTGTCAAGGCCGGAAGCCTGTGGGAT CCGAACATCAC TGCT TGTAAGAAGAATGAGGAGACA 

GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 

TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAA.CGCGAGCTTCAGTGGTGA 

TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

CAT GGGT GCT GGTGGCAGGGAT CTAT C TAATGT GGAGGCACGAAAGGATCAAGAAGAC T TCCT T T 

TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 

AGT GGCAGAAAAAGAAAATAGCAGAGAT GGGTCCAGTGCAGTGGCTT GCCACTCAAAAGAAGGCA 

GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 

GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GT GAT C T AAGAAGCCAGAT T CAT C T GC AC AAATACGTGG T GGT CT ACT T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTG TAG 



FIGURE is8 



MSLVLLSLAALCRSAVPRKPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNETSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
ST IIGFSQVFEPHQKKQTRAS WI PVTGDSEGATVQLTP Y FPTCGSDC IRHKGT WLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYIMWRHERIKKTSFSTTTLLPPIFCVLVVYPSEICFHHTICY FTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
i,k GCCSL 

ra 

Li Important features of the protein: 

m 

f[ Signal peptide: 



m 
111 

r ft 



amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP -dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 



Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 

C T T T T T T CCAAAAGCC T GAGAGTT GCCCGCCT GT GCCAGGAGGT AGTAT GAAGC T T GACAT TGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 

AC T T GGGCT GCAT CAAT GCT CAAGGAAAGGAAGACATC TCCAT GAAT T CCGT T C CCAT CCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

ACTCAGCTGAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIE S RS T S PWNYTVT WDPNRYPSEWQAQCRNLGCINAQGKED I SMNS VP I QQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 



f*J N-glycosylation site. 



amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



III 
ill 



ACACT GGCCAAACAAAAACGAAAGCACT CCGT GCTGGAAGTAGGAGGAGAGT CAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC TGAAGATGAGGAAAAGT TT GGAGGAGCAGCTGACT CAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
i3 TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
CI GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
|fl ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCGGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
III CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
y GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
^, AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGGTGACTCCCTGGGGCCTCTCAAAGA 
M CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
III GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
gj CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



fiJ 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY I HKRWALVWL ACLL FAAALSLI LLLKKDHAKGWLRLLKQDVRSGAAARG 
^ RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
Q GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
M LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

N 

Signal sequence: 

\*] amino acids 1-20 

w 

Transmembrane domain. 

F| amino acids 453-475 

t'*| N-glycosylation sites. 

01 amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

Irl 334-337, 357-360, 391-394 

ru 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAAGTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCGAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
G T CAGT GCGGGAGGCC GGTCAGCCACCAAGAT GAC TGACAGGT T CAGCTCTCTGCAGCACAC T AC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T T C T AC CAC T TAGAGC T CCAGGTCAAC CGCACCTACCAAAT GCAC C T T GGAGGGAAGCAGAGAGA 
ATATGAGT T CT T CGGC C T GACCCCT GACACAGAGT T CC T TGGCACCAT CATGAT TTGCGT T CCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
& TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
III T CCACAGC GGCATAGCC TGTCCGAGAT CACC TAC T T AGGGCAGCCAGAC AT CTCC AT CC TCCAGC 

If! CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CACC T T AGGCC TAAAGGTCAGC TT CAGAAAGAGCCACCAGC T GGAAGCT GCATGT TAGGT GGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
|P C CCTGGGGAT T T GCACAGACAGAACAT CT GACCCAAATGTGCTACACAGT GGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
II TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
Is AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGT GCAAGGAGGAAATGCAGGGAAAC TCCCGAGGT CCAGAGCC CCACC TC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCT GGAAAAGAACCAGGCC TGGAAAAGAACCAGAAGGAGGCT GGGCAGAAC CAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGTCCAC CAGGGAGGGAGACACACAAGCC T TT TC T GCAGGCAGGAG T T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAAT TCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

AT CT GT AAT GT GGGGATCATAACACCT ACCT CAT GGAGT TGT GG T GAAGAT GAAAT GAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 



rt 



en 
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FIGURE 164 
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MRTLLTILTVGSIJy^HAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence, 
amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 



N-glycosylation sites. 

m amino acids 80-83, 87-90, 172-175 



N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAA?\AAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAA C^TGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGT T T TGGATATT T T CCAAAAGAT T T GATCAAGGTACT TCAT AAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATT 
T T AATAG TT AT AAT GTAGAAGAGCT T T TAGGAT C T T TGGAAC T GGAGGAC T C TGTACC TGAAGAG 



O T CGAAGAAAGCTGAAGAAGT TT CT CAGCACAGAGAGAAAT C T CCT GAGGAGT C T CGGGGGCGTGA 



ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CATTACAGCAAAGGATT TCGTTGGCATCAAAATCTAAGT TTGT TTTACAAAGATTGTTT TTAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDL IKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMS QRGS GQCVI HYSKGFRWHQNLSLFYKDC F 



Important features of the protein: 



p Signal peptide: 

13 amino acids 1-22 



U 5 



til 



p. 

p 



HI 



N-glycosylation site. 



I amino acids 2 94-298 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site* 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAATGAAGCAGAACAGAAGT T AATAGAGAGCAT GTT GCAAAAC CT GACAAAACCCGGG 
ACAGGGAT TT CTGATGGTGATT TCTGGATAGGGCTT TGGAGGAAT GGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAASAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CT T GGGGGT CCC TACC T T TACCAGT GGAATGATGACAGGT GTAACAT GAAGCACAAT TAT AT T T G 
CAAG TAT GAACCAGAGAT TAAT CCAACAGCCCCT GTAGAAAAGCC T TAT CT TACAAAT CAACCAG 
GAGACACCCATCAGAATGTGGT TGTTACTGAAGCAGGTATAAT TCCCAAT CTAATTTATGT TGT T 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAG TAAAGGAAGAACAAAAAC TAGT CCAAACCAGTCTACAC T GT G GAT TT CAAAGAGTA 
CCAGAAAAGAAAGT GGCATGGAAG TATAAT AAC TCATTGAC T T GGT T CCAGAAT T T T GTAAT T CT 
GGAT CT GT AT AAGGAAT GGCAT CAGAACAATAGC T TGGAATGGC T TGAAAT CACAAAGGATC TGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGT GAGACATGC TTAT TT TGCTAAAGGATGCACCCAA 
ACT TCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGT TATCAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 

TGTATAT T GT ATT GAAAT T TACAGTGTGCAAAAGTAT T T TACCT T TGCAT AAGT GT T TGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACT TAT T AC TGT T GTCAAC T GAAT TCACACACACACAAATAT AG TACCAT AGAAAAAG T T TGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATT TCATTAAGTGT GATATAAACCTCCTCAAACATT TTACTTAGAGGCAAGGAT 
TGT C TAAT TT CAAT T GT GCAAGACAT GT GCCT TATAAT TAT TT T T AGCTTAAAAT TAAACAGAT T 
T TGTAATAAT G TAAC TT T GT TAAT AGGT GCATAAACACTAATGCAGT CAAT T TGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACT AT AT TAGTATACAAAGAGGT CAT GT GGT TGAGACCAGGTGAATAGT CAC TAT CAGT GTGGAG 
ACAAGCACAGCACACAGACATT T TAGGAAGGAAAGGAAC TACGAAAT CGT GT GAAAATGGGT TGG 
AAC CCAT CAGT GATCGCATATT CAT T GAT GAGGGTT TGCTTGAGATAGAAAATGGT GGC TCCT T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYI CKYEPE INPTAPVEK 
PYLTNQPGDTHQNWVTEAG 1 1 PNL I Y WI PT I PLLLL I LVAFGTCC FQMLHKS KGRTKTS PNQ 
STLWISKSTRKESGMEV 



Important features of the protein: 
Signal peptide: 
amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites* 

fJJ amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

145, 212-217 
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